Dataset statistics
| Number of variables | 35 |
|---|---|
| Number of observations | 1230000 |
| Missing cells | 1443926 |
| Missing cells (%) | 3.4% |
| Duplicate rows | 6763 |
| Duplicate rows (%) | 0.5% |
| Total size in memory | 370.1 MiB |
| Average record size in memory | 315.5 B |
Variable types
| Categorical | 22 |
|---|---|
| Numeric | 12 |
| Boolean | 1 |
FN has constant value "1.0" | Constant |
Active has constant value "1.0" | Constant |
| Dataset has 6763 (0.5%) duplicate rows | Duplicates |
customer_id has a high cardinality: 557593 distinct values | High cardinality |
prod_name has a high cardinality: 38202 distinct values | High cardinality |
product_type_name has a high cardinality: 127 distinct values | High cardinality |
department_name has a high cardinality: 249 distinct values | High cardinality |
section_name has a high cardinality: 56 distinct values | High cardinality |
detail_desc has a high cardinality: 36009 distinct values | High cardinality |
postal_code has a high cardinality: 254541 distinct values | High cardinality |
article_id is highly correlated with product_code and 1 other fields | High correlation |
price is highly correlated with section_name | High correlation |
product_code is highly correlated with article_id and 1 other fields | High correlation |
product_type_no is highly correlated with product_group_name and 6 other fields | High correlation |
graphical_appearance_no is highly correlated with graphical_appearance_name and 3 other fields | High correlation |
colour_group_code is highly correlated with graphical_appearance_name and 3 other fields | High correlation |
perceived_colour_value_id is highly correlated with graphical_appearance_no and 5 other fields | High correlation |
perceived_colour_master_id is highly correlated with graphical_appearance_name and 6 other fields | High correlation |
department_no is highly correlated with product_type_no and 9 other fields | High correlation |
section_no is highly correlated with product_group_name and 8 other fields | High correlation |
garment_group_no is highly correlated with product_type_no and 10 other fields | High correlation |
age is highly correlated with FN and 1 other fields | High correlation |
sales_channel_id is highly correlated with FN and 1 other fields | High correlation |
sale is highly correlated with FN and 1 other fields | High correlation |
product_group_name is highly correlated with product_type_no and 9 other fields | High correlation |
graphical_appearance_name is highly correlated with product_group_name and 12 other fields | High correlation |
colour_group_name is highly correlated with product_group_name and 11 other fields | High correlation |
perceived_colour_value_name is highly correlated with graphical_appearance_no and 5 other fields | High correlation |
perceived_colour_master_name is highly correlated with graphical_appearance_name and 6 other fields | High correlation |
index_code is highly correlated with product_type_no and 11 other fields | High correlation |
index_name is highly correlated with product_type_no and 11 other fields | High correlation |
index_group_no is highly correlated with department_no and 7 other fields | High correlation |
index_group_name is highly correlated with department_no and 7 other fields | High correlation |
section_name is highly correlated with article_id and 16 other fields | High correlation |
garment_group_name is highly correlated with product_type_no and 11 other fields | High correlation |
FN is highly correlated with perceived_colour_master_name and 15 other fields | High correlation |
Active is highly correlated with perceived_colour_master_name and 15 other fields | High correlation |
club_member_status is highly correlated with FN and 1 other fields | High correlation |
fashion_news_frequency is highly correlated with FN and 1 other fields | High correlation |
FN has 708745 (57.6%) missing values | Missing |
Active has 716695 (58.3%) missing values | Missing |
graphical_appearance_no is highly skewed (γ1 = -60.75114289) | Skewed |
Reproduction
| Analysis started | 2022-11-10 15:56:10.585625 |
|---|---|
| Analysis finished | 2022-11-10 16:02:44.826929 |
| Duration | 6 minutes and 34.24 seconds |
| Software version | pandas-profiling v3.4.0 |
| Download configuration | config.json |
| Distinct | 557593 |
|---|---|
| Distinct (%) | 45.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.8 MiB |
| b4db5e5259234574edfff958e170fe3a5e13b6f146752ca066abca3c156acc71 | 59 |
|---|---|
| be1981ab818cf4ef6765b2ecaea7a2cbf14ccd6e8a7ee985513d9e8e53c6d91b | 59 |
| 49beaacac0c7801c2ce2d189efe525fe80b5d37e46ed05b50a4cd88e34d0748f | 53 |
| 8df45859ccd71ef1e48e2ee9d1c65d5728c31c46ae957d659fa4e5c3af6cc076 | 52 |
| a65f77281a528bf5c1e9f270141d601d116e1df33bf9df512f495ee06647a9cc | 51 |
| Other values (557588) |
Length
| Max length | 64 |
|---|---|
| Median length | 64 |
| Mean length | 64 |
| Min length | 64 |
Characters and Unicode
| Total characters | 78720000 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 293373 ? |
|---|---|
| Unique (%) | 23.9% |
Sample
| 1st row | f05a521a2649a53841d0c5c837efb1d48e2eff7a6f6e47f94f0e21665d7adaa3 |
|---|---|
| 2nd row | 58afa373cb889cda30831ba3ca728bbb4147d5c1f3d19060f003bf5713d7f4f5 |
| 3rd row | 317ea97640e31f706565f2b61f17652ac569f05c1abc47fdf9fb2c4b446ca343 |
| 4th row | 6559a47c9760bc36d3f7a7497306daa1ea9ce4a3a340a0abfe07325b76f4cd1e |
| 5th row | 10292f992bbf7a999f8f2eee6c1b2de299ee1279e369223b73c8baf6d65fce21 |
Common Values
| Value | Count | Frequency (%) |
| b4db5e5259234574edfff958e170fe3a5e13b6f146752ca066abca3c156acc71 | 59 | < 0.1% |
| be1981ab818cf4ef6765b2ecaea7a2cbf14ccd6e8a7ee985513d9e8e53c6d91b | 59 | < 0.1% |
| 49beaacac0c7801c2ce2d189efe525fe80b5d37e46ed05b50a4cd88e34d0748f | 53 | < 0.1% |
| 8df45859ccd71ef1e48e2ee9d1c65d5728c31c46ae957d659fa4e5c3af6cc076 | 52 | < 0.1% |
| a65f77281a528bf5c1e9f270141d601d116e1df33bf9df512f495ee06647a9cc | 51 | < 0.1% |
| cd04ec2726dd58a8c753e0d6423e57716fd9ebcf2f14ed6012e7e5bea016b4d6 | 46 | < 0.1% |
| e6498c7514c61d3c24669f49753dc83fdff3ec1ba13902dd9184c959d8f0b249 | 46 | < 0.1% |
| c140410d72a41ee5e2e3ba3d7f5a860f337f1b5e41c27cf9bda5517c8774f8fa | 45 | < 0.1% |
| e97c3a6c680cd3569df10f901a61fdffaf8f70300f6adf6e266b80c87d54245a | 45 | < 0.1% |
| 6cc121e5cc202d2bf344ffe795002bdbf87178054bcda2e57161f0ef810a4b55 | 45 | < 0.1% |
| Other values (557583) | 1229499 |
Length
| Value | Count | Frequency (%) |
| b4db5e5259234574edfff958e170fe3a5e13b6f146752ca066abca3c156acc71 | 59 | < 0.1% |
| be1981ab818cf4ef6765b2ecaea7a2cbf14ccd6e8a7ee985513d9e8e53c6d91b | 59 | < 0.1% |
| 49beaacac0c7801c2ce2d189efe525fe80b5d37e46ed05b50a4cd88e34d0748f | 53 | < 0.1% |
| 8df45859ccd71ef1e48e2ee9d1c65d5728c31c46ae957d659fa4e5c3af6cc076 | 52 | < 0.1% |
| a65f77281a528bf5c1e9f270141d601d116e1df33bf9df512f495ee06647a9cc | 51 | < 0.1% |
| cd04ec2726dd58a8c753e0d6423e57716fd9ebcf2f14ed6012e7e5bea016b4d6 | 46 | < 0.1% |
| e6498c7514c61d3c24669f49753dc83fdff3ec1ba13902dd9184c959d8f0b249 | 46 | < 0.1% |
| c140410d72a41ee5e2e3ba3d7f5a860f337f1b5e41c27cf9bda5517c8774f8fa | 45 | < 0.1% |
| e97c3a6c680cd3569df10f901a61fdffaf8f70300f6adf6e266b80c87d54245a | 45 | < 0.1% |
| 6cc121e5cc202d2bf344ffe795002bdbf87178054bcda2e57161f0ef810a4b55 | 45 | < 0.1% |
| Other values (557583) | 1229499 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 4926056 | 6.3% |
| 9 | 4925571 | 6.3% |
| a | 4924966 | 6.3% |
| e | 4924738 | 6.3% |
| 8 | 4924482 | 6.3% |
| 4 | 4920700 | 6.3% |
| 1 | 4920449 | 6.3% |
| f | 4920159 | 6.3% |
| 6 | 4919620 | 6.2% |
| 0 | 4919119 | 6.2% |
| Other values (6) | 29494140 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 49201514 | |
| Lowercase Letter | 29518486 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 4926056 | |
| 9 | 4925571 | |
| 8 | 4924482 | |
| 4 | 4920700 | |
| 1 | 4920449 | |
| 6 | 4919620 | |
| 0 | 4919119 | |
| 5 | 4917939 | |
| 3 | 4914812 | |
| 7 | 4912766 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4924966 | |
| e | 4924738 | |
| f | 4920159 | |
| c | 4918898 | |
| d | 4917989 | |
| b | 4911736 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 49201514 | |
| Latin | 29518486 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 4926056 | |
| 9 | 4925571 | |
| 8 | 4924482 | |
| 4 | 4920700 | |
| 1 | 4920449 | |
| 6 | 4919620 | |
| 0 | 4919119 | |
| 5 | 4917939 | |
| 3 | 4914812 | |
| 7 | 4912766 |
Latin
| Value | Count | Frequency (%) |
| a | 4924966 | |
| e | 4924738 | |
| f | 4920159 | |
| c | 4918898 | |
| d | 4917989 | |
| b | 4911736 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 78720000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 4926056 | 6.3% |
| 9 | 4925571 | 6.3% |
| a | 4924966 | 6.3% |
| e | 4924738 | 6.3% |
| 8 | 4924482 | 6.3% |
| 4 | 4920700 | 6.3% |
| 1 | 4920449 | 6.3% |
| f | 4920159 | 6.3% |
| 6 | 4919620 | 6.2% |
| 0 | 4919119 | 6.2% |
| Other values (6) | 29494140 |
| Distinct | 82328 |
|---|---|
| Distinct (%) | 6.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 696401650.3 |
| Minimum | 108775015 |
|---|---|
| Maximum | 956217002 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 18.8 MiB |
Quantile statistics
| Minimum | 108775015 |
|---|---|
| 5-th percentile | 448515036 |
| Q1 | 632307011 |
| median | 714384002 |
| Q3 | 787028001 |
| 95-th percentile | 870970001 |
| Maximum | 956217002 |
| Range | 847441987 |
| Interquartile range (IQR) | 154720990 |
Descriptive statistics
| Standard deviation | 133245806.1 |
|---|---|
| Coefficient of variation (CV) | 0.1913347075 |
| Kurtosis | 2.527405494 |
| Mean | 696401650.3 |
| Median Absolute Deviation (MAD) | 77138000 |
| Skewness | -1.245928275 |
| Sum | 8.565740298 × 1014 |
| Variance | 1.775444483 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 706016001 | 1919 | 0.2% |
| 706016002 | 1355 | 0.1% |
| 372860001 | 1216 | 0.1% |
| 610776002 | 1171 | 0.1% |
| 464297007 | 1012 | 0.1% |
| 759871002 | 975 | 0.1% |
| 372860002 | 956 | 0.1% |
| 610776001 | 866 | 0.1% |
| 399223001 | 847 | 0.1% |
| 720125001 | 820 | 0.1% |
| Other values (82318) | 1218863 |
| Value | Count | Frequency (%) |
| 108775015 | 408 | |
| 108775044 | 280 | |
| 108775051 | 11 | < 0.1% |
| 110065001 | 39 | < 0.1% |
| 110065002 | 18 | < 0.1% |
| 110065011 | 34 | < 0.1% |
| 111565001 | 181 | < 0.1% |
| 111565003 | 1 | < 0.1% |
| 111586001 | 537 | |
| 111593001 | 494 |
| Value | Count | Frequency (%) |
| 956217002 | 1 | < 0.1% |
| 953763001 | 1 | < 0.1% |
| 953450001 | 2 | < 0.1% |
| 949551002 | 6 | |
| 949551001 | 8 | |
| 949198001 | 3 | < 0.1% |
| 948152002 | 1 | < 0.1% |
| 948152001 | 1 | < 0.1% |
| 947934001 | 5 | |
| 947599001 | 1 | < 0.1% |
| Distinct | 25984 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.02778704007 |
| Minimum | 0.0001355932203 |
|---|---|
| Maximum | 0.506779661 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 18.8 MiB |
Quantile statistics
| Minimum | 0.0001355932203 |
|---|---|
| 5-th percentile | 0.007610169492 |
| Q1 | 0.01537288136 |
| median | 0.02540677966 |
| Q3 | 0.03388135593 |
| 95-th percentile | 0.05930508475 |
| Maximum | 0.506779661 |
| Range | 0.5066440678 |
| Interquartile range (IQR) | 0.01850847458 |
Descriptive statistics
| Standard deviation | 0.01935101504 |
|---|---|
| Coefficient of variation (CV) | 0.6964043306 |
| Kurtosis | 27.30249346 |
| Mean | 0.02778704007 |
| Median Absolute Deviation (MAD) | 0.008474576271 |
| Skewness | 3.219502551 |
| Sum | 34178.05928 |
| Variance | 0.0003744617829 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.01693220339 | 128992 | 10.5% |
| 0.03388135593 | 128662 | 10.5% |
| 0.02540677966 | 123167 | 10.0% |
| 0.01354237288 | 56874 | 4.6% |
| 0.0423559322 | 56642 | 4.6% |
| 0.05083050847 | 56234 | 4.6% |
| 0.02201694915 | 49025 | 4.0% |
| 0.03049152542 | 46119 | 3.7% |
| 0.008457627119 | 40769 | 3.3% |
| 0.01523728814 | 26495 | 2.2% |
| Other values (25974) | 517021 |
| Value | Count | Frequency (%) |
| 0.0001355932203 | 1 | < 0.1% |
| 0.0002372881356 | 1 | < 0.1% |
| 0.0003220338983 | 2 | < 0.1% |
| 0.0003559322034 | 2 | < 0.1% |
| 0.0003728813559 | 1 | < 0.1% |
| 0.0003898305085 | 1 | < 0.1% |
| 0.0004237288136 | 17 | |
| 0.0004406779661 | 2 | < 0.1% |
| 0.0004576271186 | 2 | < 0.1% |
| 0.0004915254237 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.506779661 | 4 | < 0.1% |
| 0.501220339 | 1 | < 0.1% |
| 0.4919988701 | 1 | < 0.1% |
| 0.4346975914 | 1 | < 0.1% |
| 0.4220338983 | 29 | |
| 0.4177966102 | 1 | < 0.1% |
| 0.4147941889 | 1 | < 0.1% |
| 0.4133903441 | 1 | < 0.1% |
| 0.4093220339 | 1 | < 0.1% |
| 0.4088895383 | 1 | < 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.8 MiB |
| 2 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1230000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 868239 | |
| 1 | 361761 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 2 | 868239 | |
| 1 | 361761 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 868239 | |
| 1 | 361761 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1230000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 868239 | |
| 1 | 361761 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1230000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 868239 | |
| 1 | 361761 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1230000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 868239 | |
| 1 | 361761 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.6 MiB |
| True | |
|---|---|
| False | 30000 |
| Value | Count | Frequency (%) |
| True | 1200000 | |
| False | 30000 | 2.4% |
| Distinct | 39034 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 696401.6437 |
| Minimum | 108775 |
|---|---|
| Maximum | 956217 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 18.8 MiB |
Quantile statistics
| Minimum | 108775 |
|---|---|
| 5-th percentile | 448515 |
| Q1 | 632307 |
| median | 714384 |
| Q3 | 787028 |
| 95-th percentile | 870970 |
| Maximum | 956217 |
| Range | 847442 |
| Interquartile range (IQR) | 154721 |
Descriptive statistics
| Standard deviation | 133245.8094 |
|---|---|
| Coefficient of variation (CV) | 0.1913347141 |
| Kurtosis | 2.527405227 |
| Mean | 696401.6437 |
| Median Absolute Deviation (MAD) | 77138 |
| Skewness | -1.245928234 |
| Sum | 8.565740218 × 1011 |
| Variance | 1.775444572 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 706016 | 7006 | 0.6% |
| 562245 | 6015 | 0.5% |
| 610776 | 5279 | 0.4% |
| 599580 | 4578 | 0.4% |
| 717490 | 3082 | 0.3% |
| 695632 | 2881 | 0.2% |
| 372860 | 2833 | 0.2% |
| 684209 | 2786 | 0.2% |
| 759871 | 2592 | 0.2% |
| 688537 | 2552 | 0.2% |
| Other values (39024) | 1190396 |
| Value | Count | Frequency (%) |
| 108775 | 699 | |
| 110065 | 91 | < 0.1% |
| 111565 | 182 | < 0.1% |
| 111586 | 537 | |
| 111593 | 494 | |
| 111609 | 130 | < 0.1% |
| 114428 | 3 | < 0.1% |
| 116379 | 6 | < 0.1% |
| 118458 | 33 | < 0.1% |
| 120129 | 252 | < 0.1% |
| Value | Count | Frequency (%) |
| 956217 | 1 | < 0.1% |
| 953763 | 1 | < 0.1% |
| 953450 | 2 | < 0.1% |
| 949551 | 14 | |
| 949198 | 3 | < 0.1% |
| 948152 | 2 | < 0.1% |
| 947934 | 5 | < 0.1% |
| 947599 | 1 | < 0.1% |
| 947509 | 9 | |
| 947060 | 5 | < 0.1% |
| Distinct | 38202 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.8 MiB |
| Jade HW Skinny Denim TRS | 6423 |
|---|---|
| Luna skinny RW | 5448 |
| Timeless Midrise Brief | 4578 |
| Tilly (1) | 4036 |
| Cat Tee. | 3082 |
| Other values (38197) |
Length
| Max length | 30 |
|---|---|
| Median length | 22 |
| Mean length | 15.41491301 |
| Min length | 1 |
Characters and Unicode
| Total characters | 18960343 |
|---|---|
| Distinct characters | 90 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 6057 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | Hazelnut Push Melbourne |
|---|---|
| 2nd row | Rachel |
| 3rd row | Bonina loose tank |
| 4th row | Edit fancy dress |
| 5th row | Lady Di |
Common Values
| Value | Count | Frequency (%) |
| Jade HW Skinny Denim TRS | 6423 | 0.5% |
| Luna skinny RW | 5448 | 0.4% |
| Timeless Midrise Brief | 4578 | 0.4% |
| Tilly (1) | 4036 | 0.3% |
| Cat Tee. | 3082 | 0.3% |
| Shake it in Balconette | 2788 | 0.2% |
| Simple as That Triangle Top | 2786 | 0.2% |
| Tilda tank | 2592 | 0.2% |
| Simple as that Cheeky Tanga | 2552 | 0.2% |
| Despacito | 2550 | 0.2% |
| Other values (38192) | 1193165 |
Length
| Value | Count | Frequency (%) |
| dress | 76799 | 2.3% |
| top | 68992 | 2.0% |
| hw | 54778 | 1.6% |
| 1 | 43668 | 1.3% |
| tee | 43078 | 1.3% |
| skinny | 41934 | 1.2% |
| denim | 33463 | 1.0% |
| shorts | 31757 | 0.9% |
| trs | 31748 | 0.9% |
| push | 30582 | 0.9% |
| Other values (12375) | 2916588 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2149171 | 11.3% | |
| e | 1430730 | 7.5% |
| a | 1220693 | 6.4% |
| i | 1044605 | 5.5% |
| s | 909621 | 4.8% |
| r | 897181 | 4.7% |
| n | 851277 | 4.5% |
| o | 760560 | 4.0% |
| t | 743435 | 3.9% |
| l | 738327 | 3.9% |
| Other values (80) | 8214743 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12116310 | |
| Uppercase Letter | 4169298 | 22.0% |
| Space Separator | 2149171 | 11.3% |
| Decimal Number | 203192 | 1.1% |
| Other Punctuation | 92692 | 0.5% |
| Open Punctuation | 81481 | 0.4% |
| Close Punctuation | 81172 | 0.4% |
| Dash Punctuation | 59682 | 0.3% |
| Math Symbol | 5632 | < 0.1% |
| Modifier Symbol | 1213 | < 0.1% |
| Other values (2) | 500 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1430730 | |
| a | 1220693 | 10.1% |
| i | 1044605 | 8.6% |
| s | 909621 | 7.5% |
| r | 897181 | 7.4% |
| n | 851277 | 7.0% |
| o | 760560 | 6.3% |
| t | 743435 | 6.1% |
| l | 738327 | 6.1% |
| p | 416862 | 3.4% |
| Other values (23) | 3103019 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 479587 | 11.5% |
| T | 356434 | 8.5% |
| R | 257187 | 6.2% |
| E | 251058 | 6.0% |
| L | 248650 | 6.0% |
| P | 248608 | 6.0% |
| B | 219129 | 5.3% |
| A | 216370 | 5.2% |
| C | 214556 | 5.1% |
| M | 182256 | 4.4% |
| Other values (22) | 1495463 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 76151 | |
| 2 | 39838 | |
| 3 | 24080 | 11.9% |
| 5 | 16070 | 7.9% |
| 9 | 13242 | 6.5% |
| 0 | 13046 | 6.4% |
| 7 | 8833 | 4.3% |
| 4 | 7555 | 3.7% |
| 8 | 3000 | 1.5% |
| 6 | 1377 | 0.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 69853 | |
| / | 14266 | 15.4% |
| & | 5447 | 5.9% |
| ! | 1801 | 1.9% |
| : | 935 | 1.0% |
| ' | 344 | 0.4% |
| ? | 46 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2149171 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 81481 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 81172 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 59682 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 5632 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ^ | 1213 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 499 |
Other Symbol
| Value | Count | Frequency (%) |
| © | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16285608 | |
| Common | 2674735 | 14.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1430730 | 8.8% |
| a | 1220693 | 7.5% |
| i | 1044605 | 6.4% |
| s | 909621 | 5.6% |
| r | 897181 | 5.5% |
| n | 851277 | 5.2% |
| o | 760560 | 4.7% |
| t | 743435 | 4.6% |
| l | 738327 | 4.5% |
| S | 479587 | 2.9% |
| Other values (55) | 7209592 |
Common
| Value | Count | Frequency (%) |
| 2149171 | ||
| ( | 81481 | 3.0% |
| ) | 81172 | 3.0% |
| 1 | 76151 | 2.8% |
| . | 69853 | 2.6% |
| - | 59682 | 2.2% |
| 2 | 39838 | 1.5% |
| 3 | 24080 | 0.9% |
| 5 | 16070 | 0.6% |
| / | 14266 | 0.5% |
| Other values (15) | 62971 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18959435 | |
| None | 908 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2149171 | 11.3% | |
| e | 1430730 | 7.5% |
| a | 1220693 | 6.4% |
| i | 1044605 | 5.5% |
| s | 909621 | 4.8% |
| r | 897181 | 4.7% |
| n | 851277 | 4.5% |
| o | 760560 | 4.0% |
| t | 743435 | 3.9% |
| l | 738327 | 3.9% |
| Other values (66) | 8213835 |
None
| Value | Count | Frequency (%) |
| ö | 213 | |
| é | 209 | |
| è | 138 | |
| Ä | 99 | |
| Ö | 54 | 5.9% |
| í | 52 | 5.7% |
| ë | 43 | 4.7% |
| å | 34 | 3.7% |
| É | 32 | 3.5% |
| ä | 20 | 2.2% |
| Other values (4) | 14 | 1.5% |
| Distinct | 128 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 245.4353106 |
| Minimum | -1 |
|---|---|
| Maximum | 762 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 3597 |
| Negative (%) | 0.3% |
| Memory size | 18.8 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 66 |
| Q1 | 253 |
| median | 264 |
| Q3 | 273 |
| 95-th percentile | 306 |
| Maximum | 762 |
| Range | 763 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 68.5649378 |
|---|---|
| Coefficient of variation (CV) | 0.2793605274 |
| Kurtosis | 3.221356815 |
| Mean | 245.4353106 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | -1.843928393 |
| Sum | 301885432 |
| Variance | 4701.150696 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 272 | 161142 | 13.1% |
| 265 | 123984 | 10.1% |
| 252 | 107245 | 8.7% |
| 255 | 85819 | 7.0% |
| 254 | 60523 | 4.9% |
| 258 | 57742 | 4.7% |
| 253 | 53568 | 4.4% |
| 306 | 50835 | 4.1% |
| 274 | 44651 | 3.6% |
| 298 | 42698 | 3.5% |
| Other values (118) | 441793 |
| Value | Count | Frequency (%) |
| -1 | 3597 | 0.3% |
| 49 | 37 | < 0.1% |
| 57 | 10915 | 0.9% |
| 59 | 41813 | |
| 60 | 25 | < 0.1% |
| 66 | 9148 | 0.7% |
| 67 | 7821 | 0.6% |
| 68 | 512 | < 0.1% |
| 69 | 1355 | 0.1% |
| 70 | 7659 | 0.6% |
| Value | Count | Frequency (%) |
| 762 | 3 | < 0.1% |
| 761 | 6 | < 0.1% |
| 532 | 133 | < 0.1% |
| 529 | 74 | < 0.1% |
| 525 | 4 | < 0.1% |
| 523 | 15 | < 0.1% |
| 521 | 13 | < 0.1% |
| 515 | 53 | < 0.1% |
| 514 | 4 | < 0.1% |
| 512 | 1725 |
| Distinct | 127 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.8 MiB |
| Trousers | |
|---|---|
| Dress | |
| Sweater | |
| T-shirt | |
| Top | 60523 |
| Other values (122) |
Length
| Max length | 24 |
|---|---|
| Median length | 19 |
| Mean length | 7.497685366 |
| Min length | 3 |
Characters and Unicode
| Total characters | 9222153 |
|---|---|
| Distinct characters | 51 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Bra |
|---|---|
| 2nd row | Sweater |
| 3rd row | Vest top |
| 4th row | Dress |
| 5th row | Sweater |
Common Values
| Value | Count | Frequency (%) |
| Trousers | 161142 | 13.1% |
| Dress | 123984 | 10.1% |
| Sweater | 107245 | 8.7% |
| T-shirt | 85819 | 7.0% |
| Top | 60523 | 4.9% |
| Blouse | 57742 | 4.7% |
| Vest top | 53568 | 4.4% |
| Bra | 50835 | 4.1% |
| Shorts | 44651 | 3.6% |
| Bikini top | 42698 | 3.5% |
| Other values (117) | 441793 |
Length
| Value | Count | Frequency (%) |
| trousers | 161470 | 11.0% |
| top | 157194 | 10.7% |
| dress | 123984 | 8.5% |
| sweater | 107245 | 7.3% |
| bottom | 86720 | 5.9% |
| t-shirt | 85819 | 5.9% |
| blouse | 57742 | 3.9% |
| vest | 53568 | 3.7% |
| underwear | 53077 | 3.6% |
| bra | 50886 | 3.5% |
| Other values (135) | 526067 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 1046926 | 11.4% |
| s | 1011179 | 11.0% |
| e | 953284 | 10.3% |
| t | 782260 | 8.5% |
| o | 709446 | 7.7% |
| i | 516494 | 5.6% |
| a | 464583 | 5.0% |
| T | 344145 | 3.7% |
| S | 317223 | 3.4% |
| u | 273526 | 3.0% |
| Other values (41) | 2803087 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7575875 | |
| Uppercase Letter | 1278362 | 13.9% |
| Space Separator | 233772 | 2.5% |
| Dash Punctuation | 85860 | 0.9% |
| Other Punctuation | 48284 | 0.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 1046926 | |
| s | 1011179 | |
| e | 953284 | |
| t | 782260 | |
| o | 709446 | |
| i | 516494 | 6.8% |
| a | 464583 | 6.1% |
| u | 273526 | 3.6% |
| w | 261929 | 3.5% |
| h | 211075 | 2.8% |
| Other values (15) | 1345173 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 344145 | |
| S | 317223 | |
| B | 201445 | |
| D | 125547 | 9.8% |
| U | 57232 | 4.5% |
| V | 53568 | 4.2% |
| H | 36138 | 2.8% |
| J | 32893 | 2.6% |
| L | 28350 | 2.2% |
| P | 25950 | 2.0% |
| Other values (12) | 55871 | 4.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 48280 | |
| . | 4 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 233772 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 85860 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8854237 | |
| Common | 367916 | 4.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 1046926 | |
| s | 1011179 | |
| e | 953284 | 10.8% |
| t | 782260 | 8.8% |
| o | 709446 | 8.0% |
| i | 516494 | 5.8% |
| a | 464583 | 5.2% |
| T | 344145 | 3.9% |
| S | 317223 | 3.6% |
| u | 273526 | 3.1% |
| Other values (37) | 2435171 |
Common
| Value | Count | Frequency (%) |
| 233772 | ||
| - | 85860 | 23.3% |
| / | 48280 | 13.1% |
| . | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9222153 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 1046926 | 11.4% |
| s | 1011179 | 11.0% |
| e | 953284 | 10.3% |
| t | 782260 | 8.5% |
| o | 709446 | 7.7% |
| i | 516494 | 5.6% |
| a | 464583 | 5.0% |
| T | 344145 | 3.7% |
| S | 317223 | 3.4% |
| u | 273526 | 3.0% |
| Other values (41) | 2803087 |
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.8 MiB |
| Garment Upper body | |
|---|---|
| Garment Lower body | |
| Garment Full body | |
| Underwear | |
| Swimwear | |
| Other values (14) |
Length
| Max length | 21 |
|---|---|
| Median length | 18 |
| Mean length | 15.44760976 |
| Min length | 3 |
Characters and Unicode
| Total characters | 19000560 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Underwear |
|---|---|
| 2nd row | Garment Upper body |
| 3rd row | Garment Upper body |
| 4th row | Garment Full body |
| 5th row | Garment Upper body |
Common Values
| Value | Count | Frequency (%) |
| Garment Upper body | 484557 | |
| Garment Lower body | 270415 | |
| Garment Full body | 136904 | 11.1% |
| Underwear | 98277 | 8.0% |
| Swimwear | 97859 | 8.0% |
| Accessories | 67039 | 5.5% |
| Shoes | 30091 | 2.4% |
| Socks & Tights | 26763 | 2.2% |
| Nightwear | 13812 | 1.1% |
| Unknown | 3597 | 0.3% |
| Other values (9) | 686 | 0.1% |
Length
| Value | Count | Frequency (%) |
| garment | 891887 | |
| body | 891876 | |
| upper | 484557 | |
| lower | 270415 | 8.8% |
| full | 136904 | 4.5% |
| underwear | 98277 | 3.2% |
| swimwear | 97859 | 3.2% |
| accessories | 67039 | 2.2% |
| shoes | 30091 | 1.0% |
| socks | 26763 | 0.9% |
| Other values (16) | 71646 | 2.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2119752 | 11.2% |
| r | 2022312 | 10.6% |
| 1837314 | 9.7% | |
| o | 1289878 | 6.8% |
| a | 1102233 | 5.8% |
| n | 1001081 | 5.3% |
| d | 990206 | 5.2% |
| m | 990062 | 5.2% |
| p | 969114 | 5.1% |
| t | 932861 | 4.9% |
| Other values (25) | 5745747 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 14987791 | |
| Uppercase Letter | 2148650 | 11.3% |
| Space Separator | 1837314 | 9.7% |
| Other Punctuation | 26805 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2119752 | |
| r | 2022312 | |
| o | 1289878 | |
| a | 1102233 | |
| n | 1001081 | 6.7% |
| d | 990206 | 6.6% |
| m | 990062 | 6.6% |
| p | 969114 | 6.5% |
| t | 932861 | 6.2% |
| y | 891882 | 6.0% |
| Other values (11) | 2678410 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 891887 | |
| U | 586473 | |
| L | 270415 | 12.6% |
| S | 154730 | 7.2% |
| F | 136926 | 6.4% |
| A | 67039 | 3.1% |
| T | 26763 | 1.2% |
| N | 13812 | 0.6% |
| B | 286 | < 0.1% |
| I | 242 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 26763 | |
| / | 42 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 1837314 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17136441 | |
| Common | 1864119 | 9.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2119752 | |
| r | 2022312 | |
| o | 1289878 | 7.5% |
| a | 1102233 | 6.4% |
| n | 1001081 | 5.8% |
| d | 990206 | 5.8% |
| m | 990062 | 5.8% |
| p | 969114 | 5.7% |
| t | 932861 | 5.4% |
| G | 891887 | 5.2% |
| Other values (22) | 4827055 |
Common
| Value | Count | Frequency (%) |
| 1837314 | ||
| & | 26763 | 1.4% |
| / | 42 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19000560 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2119752 | 11.2% |
| r | 2022312 | 10.6% |
| 1837314 | 9.7% | |
| o | 1289878 | 6.8% |
| a | 1102233 | 5.8% |
| n | 1001081 | 5.3% |
| d | 990206 | 5.2% |
| m | 990062 | 5.2% |
| p | 969114 | 5.1% |
| t | 932861 | 4.9% |
| Other values (25) | 5745747 |
| Distinct | 30 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1009740.487 |
| Minimum | -1 |
|---|---|
| Maximum | 1010029 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 333 |
| Negative (%) | < 0.1% |
| Memory size | 18.8 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 1010001 |
| Q1 | 1010010 |
| median | 1010016 |
| Q3 | 1010016 |
| 95-th percentile | 1010023 |
| Maximum | 1010029 |
| Range | 1010030 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 16616.46993 |
|---|---|
| Coefficient of variation (CV) | 0.01645617874 |
| Kurtosis | 3688.707895 |
| Mean | 1009740.487 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -60.75114289 |
| Sum | 1.241980799 × 1012 |
| Variance | 276107072.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1010016 | 688101 | |
| 1010001 | 154917 | 12.6% |
| 1010023 | 75193 | 6.1% |
| 1010010 | 73541 | 6.0% |
| 1010017 | 56140 | 4.6% |
| 1010026 | 28222 | 2.3% |
| 1010004 | 22874 | 1.9% |
| 1010021 | 22530 | 1.8% |
| 1010014 | 17410 | 1.4% |
| 1010008 | 13114 | 1.1% |
| Other values (20) | 77958 | 6.3% |
| Value | Count | Frequency (%) |
| -1 | 333 | < 0.1% |
| 1010001 | 154917 | |
| 1010002 | 5721 | 0.5% |
| 1010003 | 45 | < 0.1% |
| 1010004 | 22874 | 1.9% |
| 1010005 | 9611 | 0.8% |
| 1010006 | 9299 | 0.8% |
| 1010007 | 12241 | 1.0% |
| 1010008 | 13114 | 1.1% |
| 1010009 | 7160 | 0.6% |
| Value | Count | Frequency (%) |
| 1010029 | 13 | < 0.1% |
| 1010028 | 1541 | 0.1% |
| 1010027 | 1144 | 0.1% |
| 1010026 | 28222 | 2.3% |
| 1010025 | 869 | 0.1% |
| 1010024 | 1387 | 0.1% |
| 1010023 | 75193 | |
| 1010022 | 6807 | 0.6% |
| 1010021 | 22530 | 1.8% |
| 1010020 | 6694 | 0.5% |
| Distinct | 30 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.8 MiB |
| Solid | |
|---|---|
| All over pattern | |
| Denim | |
| Melange | |
| Stripe | 56140 |
| Other values (25) |
Length
| Max length | 19 |
|---|---|
| Median length | 5 |
| Mean length | 7.308760163 |
| Min length | 3 |
Characters and Unicode
| Total characters | 8989775 |
|---|---|
| Distinct characters | 42 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Solid |
|---|---|
| 2nd row | Melange |
| 3rd row | Solid |
| 4th row | All over pattern |
| 5th row | Solid |
Common Values
| Value | Count | Frequency (%) |
| Solid | 688101 | |
| All over pattern | 154917 | 12.6% |
| Denim | 75193 | 6.1% |
| Melange | 73541 | 6.0% |
| Stripe | 56140 | 4.6% |
| Other structure | 28222 | 2.3% |
| Check | 22874 | 1.9% |
| Lace | 22530 | 1.8% |
| Placement print | 17410 | 1.4% |
| Front print | 13114 | 1.1% |
| Other values (20) | 77958 | 6.3% |
Length
| Value | Count | Frequency (%) |
| solid | 688101 | |
| pattern | 156618 | 9.7% |
| over | 154917 | 9.6% |
| all | 154917 | 9.6% |
| denim | 75193 | 4.7% |
| melange | 73541 | 4.6% |
| stripe | 56140 | 3.5% |
| 30524 | 1.9% | |
| other | 29923 | 1.9% |
| structure | 28222 | 1.7% |
| Other values (25) | 164805 | 10.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 1144227 | |
| o | 922288 | |
| i | 916076 | |
| e | 778360 | |
| S | 747945 | |
| d | 713167 | 7.9% |
| t | 584594 | 6.5% |
| r | 562653 | 6.3% |
| n | 408194 | 4.5% |
| 382901 | 4.3% | |
| Other values (32) | 1829370 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7342382 | |
| Uppercase Letter | 1242881 | 13.8% |
| Space Separator | 382901 | 4.3% |
| Other Punctuation | 15890 | 0.2% |
| Decimal Number | 5721 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 1144227 | |
| o | 922288 | |
| i | 916076 | |
| e | 778360 | |
| d | 713167 | |
| t | 584594 | |
| r | 562653 | |
| n | 408194 | 5.6% |
| a | 317340 | 4.3% |
| p | 259585 | 3.5% |
| Other values (13) | 735898 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 747945 | |
| A | 160683 | 12.9% |
| D | 90213 | 7.3% |
| M | 87742 | 7.1% |
| C | 40566 | 3.3% |
| O | 29923 | 2.4% |
| L | 22530 | 1.8% |
| P | 17410 | 1.4% |
| F | 13114 | 1.1% |
| E | 12241 | 1.0% |
| Other values (6) | 20514 | 1.7% |
Space Separator
| Value | Count | Frequency (%) |
| 382901 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 15890 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 5721 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8585263 | |
| Common | 404512 | 4.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 1144227 | |
| o | 922288 | |
| i | 916076 | |
| e | 778360 | |
| S | 747945 | |
| d | 713167 | |
| t | 584594 | 6.8% |
| r | 562653 | 6.6% |
| n | 408194 | 4.8% |
| a | 317340 | 3.7% |
| Other values (29) | 1490419 |
Common
| Value | Count | Frequency (%) |
| 382901 | ||
| / | 15890 | 3.9% |
| 3 | 5721 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8989775 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 1144227 | |
| o | 922288 | |
| i | 916076 | |
| e | 778360 | |
| S | 747945 | |
| d | 713167 | 7.9% |
| t | 584594 | 6.5% |
| r | 562653 | 6.3% |
| n | 408194 | 4.5% |
| 382901 | 4.3% | |
| Other values (32) | 1829370 |
| Distinct | 50 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.40332114 |
| Minimum | -1 |
|---|---|
| Maximum | 93 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 207 |
| Negative (%) | < 0.1% |
| Memory size | 18.8 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 9 |
| median | 10 |
| Q3 | 43 |
| 95-th percentile | 73 |
| Maximum | 93 |
| Range | 94 |
| Interquartile range (IQR) | 34 |
Descriptive statistics
| Standard deviation | 26.21933171 |
|---|---|
| Coefficient of variation (CV) | 0.99303158 |
| Kurtosis | -0.07699996051 |
| Mean | 26.40332114 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.206646349 |
| Sum | 32476085 |
| Variance | 687.4533552 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 421724 | |
| 10 | 129229 | 10.5% |
| 73 | 84476 | 6.9% |
| 12 | 48520 | 3.9% |
| 72 | 41798 | 3.4% |
| 13 | 35672 | 2.9% |
| 71 | 35155 | 2.9% |
| 51 | 34866 | 2.8% |
| 7 | 33660 | 2.7% |
| 11 | 32689 | 2.7% |
| Other values (40) | 332211 |
| Value | Count | Frequency (%) |
| -1 | 207 | < 0.1% |
| 1 | 853 | 0.1% |
| 2 | 362 | < 0.1% |
| 3 | 4917 | 0.4% |
| 4 | 424 | < 0.1% |
| 5 | 10488 | 0.9% |
| 6 | 15991 | 1.3% |
| 7 | 33660 | 2.7% |
| 8 | 27883 | 2.3% |
| 9 | 421724 |
| Value | Count | Frequency (%) |
| 93 | 27365 | 2.2% |
| 92 | 7297 | 0.6% |
| 91 | 4807 | 0.4% |
| 90 | 618 | 0.1% |
| 83 | 4588 | 0.4% |
| 82 | 2888 | 0.2% |
| 81 | 3086 | 0.3% |
| 80 | 26 | < 0.1% |
| 73 | 84476 | |
| 72 | 41798 |
| Distinct | 50 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.8 MiB |
| Black | |
|---|---|
| White | |
| Dark Blue | |
| Light Beige | 48520 |
| Blue | 41798 |
| Other values (45) |
Length
| Max length | 15 |
|---|---|
| Median length | 14 |
| Mean length | 6.929918699 |
| Min length | 3 |
Characters and Unicode
| Total characters | 8523800 |
|---|---|
| Distinct characters | 38 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Light Orange |
|---|---|
| 2nd row | Dark Grey |
| 3rd row | White |
| 4th row | Black |
| 5th row | Dark Blue |
Common Values
| Value | Count | Frequency (%) |
| Black | 421724 | |
| White | 129229 | 10.5% |
| Dark Blue | 84476 | 6.9% |
| Light Beige | 48520 | 3.9% |
| Blue | 41798 | 3.4% |
| Beige | 35672 | 2.9% |
| Light Blue | 35155 | 2.9% |
| Light Pink | 34866 | 2.8% |
| Grey | 33660 | 2.7% |
| Off White | 32689 | 2.7% |
| Other values (40) | 332211 |
Length
| Value | Count | Frequency (%) |
| black | 421724 | |
| dark | 207702 | |
| light | 170468 | |
| white | 161918 | 9.5% |
| blue | 161767 | 9.5% |
| beige | 97814 | 5.8% |
| grey | 77534 | 4.6% |
| pink | 64611 | 3.8% |
| red | 59630 | 3.5% |
| green | 40087 | 2.4% |
| Other values (16) | 233117 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 910641 | 10.7% |
| k | 723650 | 8.5% |
| l | 699664 | 8.2% |
| B | 697896 | 8.2% |
| a | 692981 | 8.1% |
| i | 587631 | 6.9% |
| 466372 | 5.5% | |
| r | 439913 | 5.2% |
| c | 421724 | 4.9% |
| h | 418158 | 4.9% |
| Other values (28) | 2465170 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6360208 | |
| Uppercase Letter | 1696796 | 19.9% |
| Space Separator | 466372 | 5.5% |
| Other Punctuation | 424 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 910641 | |
| k | 723650 | |
| l | 699664 | |
| a | 692981 | |
| i | 587631 | |
| r | 439913 | |
| c | 421724 | |
| h | 418158 | |
| t | 341205 | 5.4% |
| g | 301707 | 4.7% |
| Other values (12) | 822934 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 697896 | |
| D | 207702 | 12.2% |
| L | 170468 | 10.0% |
| W | 161918 | 9.5% |
| G | 159851 | 9.4% |
| O | 74571 | 4.4% |
| P | 72333 | 4.3% |
| R | 59630 | 3.5% |
| Y | 46523 | 2.7% |
| K | 29406 | 1.7% |
| Other values (4) | 16498 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 466372 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 424 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8057004 | |
| Common | 466796 | 5.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 910641 | |
| k | 723650 | 9.0% |
| l | 699664 | 8.7% |
| B | 697896 | 8.7% |
| a | 692981 | 8.6% |
| i | 587631 | 7.3% |
| r | 439913 | 5.5% |
| c | 421724 | 5.2% |
| h | 418158 | 5.2% |
| t | 341205 | 4.2% |
| Other values (26) | 2123541 |
Common
| Value | Count | Frequency (%) |
| 466372 | ||
| / | 424 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8523800 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 910641 | 10.7% |
| k | 723650 | 8.5% |
| l | 699664 | 8.2% |
| B | 697896 | 8.2% |
| a | 692981 | 8.1% |
| i | 587631 | 6.9% |
| 466372 | 5.5% | |
| r | 439913 | 5.2% |
| c | 421724 | 4.9% |
| h | 418158 | 4.9% |
| Other values (28) | 2465170 |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.262581301 |
| Minimum | -1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 207 |
| Negative (%) | < 0.1% |
| Memory size | 18.8 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 7 |
| Range | 8 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.421624465 |
|---|---|
| Coefficient of variation (CV) | 0.4357361039 |
| Kurtosis | 0.2053572724 |
| Mean | 3.262581301 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.07923424239 |
| Sum | 4012975 |
| Variance | 2.021016118 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 598176 | |
| 1 | 211592 | 17.2% |
| 3 | 176942 | 14.4% |
| 2 | 145030 | 11.8% |
| 5 | 48759 | 4.0% |
| 7 | 48441 | 3.9% |
| 6 | 853 | 0.1% |
| -1 | 207 | < 0.1% |
| Value | Count | Frequency (%) |
| -1 | 207 | < 0.1% |
| 1 | 211592 | 17.2% |
| 2 | 145030 | 11.8% |
| 3 | 176942 | 14.4% |
| 4 | 598176 | |
| 5 | 48759 | 4.0% |
| 6 | 853 | 0.1% |
| 7 | 48441 | 3.9% |
| Value | Count | Frequency (%) |
| 7 | 48441 | 3.9% |
| 6 | 853 | 0.1% |
| 5 | 48759 | 4.0% |
| 4 | 598176 | |
| 3 | 176942 | 14.4% |
| 2 | 145030 | 11.8% |
| 1 | 211592 | 17.2% |
| -1 | 207 | < 0.1% |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.8 MiB |
| Dark | |
|---|---|
| Dusty Light | |
| Light | |
| Medium Dusty | |
| Bright | 48759 |
| Other values (3) | 49501 |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 6.453343089 |
| Min length | 4 |
Characters and Unicode
| Total characters | 7937612 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Dusty Light |
|---|---|
| 2nd row | Dark |
| 3rd row | Light |
| 4th row | Dark |
| 5th row | Dark |
Common Values
| Value | Count | Frequency (%) |
| Dark | 598176 | |
| Dusty Light | 211592 | 17.2% |
| Light | 176942 | 14.4% |
| Medium Dusty | 145030 | 11.8% |
| Bright | 48759 | 4.0% |
| Medium | 48441 | 3.9% |
| Undefined | 853 | 0.1% |
| Unknown | 207 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| dark | 598176 | |
| light | 388534 | |
| dusty | 356622 | |
| medium | 193471 | 12.2% |
| bright | 48759 | 3.1% |
| undefined | 853 | 0.1% |
| unknown | 207 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| D | 954798 | |
| t | 793915 | |
| r | 646935 | 8.2% |
| i | 631617 | 8.0% |
| k | 598383 | 7.5% |
| a | 598176 | 7.5% |
| u | 550093 | 6.9% |
| h | 437293 | 5.5% |
| g | 437293 | 5.5% |
| L | 388534 | 4.9% |
| Other values (13) | 1900575 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5994368 | |
| Uppercase Letter | 1586622 | 20.0% |
| Space Separator | 356622 | 4.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 793915 | |
| r | 646935 | |
| i | 631617 | |
| k | 598383 | |
| a | 598176 | |
| u | 550093 | |
| h | 437293 | |
| g | 437293 | |
| y | 356622 | |
| s | 356622 | |
| Other values (7) | 587419 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 954798 | |
| L | 388534 | |
| M | 193471 | 12.2% |
| B | 48759 | 3.1% |
| U | 1060 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 356622 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7580990 | |
| Common | 356622 | 4.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| D | 954798 | |
| t | 793915 | |
| r | 646935 | |
| i | 631617 | |
| k | 598383 | 7.9% |
| a | 598176 | 7.9% |
| u | 550093 | 7.3% |
| h | 437293 | 5.8% |
| g | 437293 | 5.8% |
| L | 388534 | 5.1% |
| Other values (12) | 1543953 |
Common
| Value | Count | Frequency (%) |
| 356622 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7937612 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| D | 954798 | |
| t | 793915 | |
| r | 646935 | 8.2% |
| i | 631617 | 8.0% |
| k | 598383 | 7.5% |
| a | 598176 | 7.5% |
| u | 550093 | 6.9% |
| h | 437293 | 5.5% |
| g | 437293 | 5.5% |
| L | 388534 | 4.9% |
| Other values (13) | 1900575 |
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.61538374 |
| Minimum | -1 |
|---|---|
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 10434 |
| Negative (%) | 0.8% |
| Memory size | 18.8 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 5 |
| median | 5 |
| Q3 | 11 |
| 95-th percentile | 19 |
| Maximum | 20 |
| Range | 21 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 5.061592998 |
|---|---|
| Coefficient of variation (CV) | 0.6646537025 |
| Kurtosis | 0.1165722332 |
| Mean | 7.61538374 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.9687374366 |
| Sum | 9366922 |
| Variance | 25.61972368 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 418691 | |
| 9 | 162441 | 13.2% |
| 2 | 161946 | 13.2% |
| 11 | 79527 | 6.5% |
| 12 | 75152 | 6.1% |
| 4 | 63831 | 5.2% |
| 18 | 60190 | 4.9% |
| 19 | 36371 | 3.0% |
| 20 | 34489 | 2.8% |
| 8 | 29167 | 2.4% |
| Other values (10) | 108195 | 8.8% |
| Value | Count | Frequency (%) |
| -1 | 10434 | 0.8% |
| 1 | 12937 | 1.1% |
| 2 | 161946 | 13.2% |
| 3 | 27095 | 2.2% |
| 4 | 63831 | 5.2% |
| 5 | 418691 | |
| 6 | 7660 | 0.6% |
| 7 | 9169 | 0.7% |
| 8 | 29167 | 2.4% |
| 9 | 162441 | 13.2% |
| Value | Count | Frequency (%) |
| 20 | 34489 | |
| 19 | 36371 | |
| 18 | 60190 | |
| 16 | 7 | < 0.1% |
| 15 | 15829 | 1.3% |
| 14 | 853 | 0.1% |
| 13 | 24182 | 2.0% |
| 12 | 75152 | |
| 11 | 79527 | |
| 10 | 29 | < 0.1% |
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.8 MiB |
| Black | |
|---|---|
| White | |
| Blue | |
| Beige | |
| Grey | |
| Other values (15) |
Length
| Max length | 15 |
|---|---|
| Median length | 5 |
| Mean length | 4.954361789 |
| Min length | 3 |
Characters and Unicode
| Total characters | 6093865 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Orange |
|---|---|
| 2nd row | Grey |
| 3rd row | White |
| 4th row | Black |
| 5th row | Blue |
Common Values
| Value | Count | Frequency (%) |
| Black | 418691 | |
| White | 162441 | 13.2% |
| Blue | 161946 | 13.2% |
| Beige | 79527 | 6.5% |
| Grey | 75152 | 6.1% |
| Pink | 63831 | 5.2% |
| Red | 60190 | 4.9% |
| Green | 36371 | 3.0% |
| Khaki green | 34489 | 2.8% |
| Yellow | 29167 | 2.4% |
| Other values (10) | 108195 | 8.8% |
Length
| Value | Count | Frequency (%) |
| black | 418691 | |
| white | 162441 | 12.8% |
| blue | 161946 | 12.7% |
| beige | 79527 | 6.3% |
| grey | 75152 | 5.9% |
| green | 70896 | 5.6% |
| pink | 63831 | 5.0% |
| red | 60190 | 4.7% |
| khaki | 34489 | 2.7% |
| yellow | 29167 | 2.3% |
| Other values (11) | 115855 | 9.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 864167 | |
| B | 684353 | |
| l | 683122 | |
| k | 527445 | 8.7% |
| a | 503764 | 8.3% |
| c | 426351 | 7.0% |
| i | 358006 | 5.9% |
| n | 219012 | 3.6% |
| r | 214154 | 3.5% |
| h | 196966 | 3.2% |
| Other values (23) | 1416525 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4814837 | |
| Uppercase Letter | 1236843 | 20.3% |
| Space Separator | 42185 | 0.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 864167 | |
| l | 683122 | |
| k | 527445 | |
| a | 503764 | |
| c | 426351 | |
| i | 358006 | |
| n | 219012 | 4.5% |
| r | 214154 | 4.4% |
| h | 196966 | 4.1% |
| u | 188804 | 3.9% |
| Other values (10) | 633046 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 684353 | |
| W | 162441 | 13.1% |
| G | 111559 | 9.0% |
| P | 71491 | 5.8% |
| R | 60190 | 4.9% |
| K | 34489 | 2.8% |
| Y | 29196 | 2.4% |
| M | 28766 | 2.3% |
| O | 27095 | 2.2% |
| U | 10434 | 0.8% |
| Other values (2) | 16829 | 1.4% |
Space Separator
| Value | Count | Frequency (%) |
| 42185 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6051680 | |
| Common | 42185 | 0.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 864167 | |
| B | 684353 | |
| l | 683122 | |
| k | 527445 | 8.7% |
| a | 503764 | 8.3% |
| c | 426351 | 7.0% |
| i | 358006 | 5.9% |
| n | 219012 | 3.6% |
| r | 214154 | 3.5% |
| h | 196966 | 3.3% |
| Other values (22) | 1374340 |
Common
| Value | Count | Frequency (%) |
| 42185 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6093865 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 864167 | |
| B | 684353 | |
| l | 683122 | |
| k | 527445 | 8.7% |
| a | 503764 | 8.3% |
| c | 426351 | 7.0% |
| i | 358006 | 5.9% |
| n | 219012 | 3.6% |
| r | 214154 | 3.5% |
| h | 196966 | 3.2% |
| Other values (23) | 1416525 |
| Distinct | 298 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2906.439874 |
| Minimum | 1201 |
|---|---|
| Maximum | 9989 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 18.8 MiB |
Quantile statistics
| Minimum | 1201 |
|---|---|
| 5-th percentile | 1322 |
| Q1 | 1616 |
| median | 1717 |
| Q3 | 3948 |
| 95-th percentile | 8310 |
| Maximum | 9989 |
| Range | 8788 |
| Interquartile range (IQR) | 2332 |
Descriptive statistics
| Standard deviation | 2121.898698 |
|---|---|
| Coefficient of variation (CV) | 0.7300679837 |
| Kurtosis | 1.288360006 |
| Mean | 2906.439874 |
| Median Absolute Deviation (MAD) | 373 |
| Skewness | 1.519970575 |
| Sum | 3574921045 |
| Variance | 4502454.086 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4242 | 92729 | 7.5% |
| 1676 | 54663 | 4.4% |
| 1338 | 45629 | 3.7% |
| 1722 | 43805 | 3.6% |
| 1636 | 43741 | 3.6% |
| 1643 | 43577 | 3.5% |
| 1522 | 34567 | 2.8% |
| 1626 | 32437 | 2.6% |
| 1747 | 30775 | 2.5% |
| 1322 | 29159 | 2.4% |
| Other values (288) | 778918 |
| Value | Count | Frequency (%) |
| 1201 | 13673 | |
| 1202 | 32 | < 0.1% |
| 1212 | 6803 | 0.6% |
| 1222 | 7763 | 0.6% |
| 1241 | 397 | < 0.1% |
| 1244 | 7565 | 0.6% |
| 1310 | 3506 | 0.3% |
| 1313 | 9541 | 0.8% |
| 1322 | 29159 | |
| 1334 | 20912 |
| Value | Count | Frequency (%) |
| 9989 | 602 | < 0.1% |
| 9986 | 1268 | 0.1% |
| 9985 | 1541 | |
| 9984 | 1600 | |
| 9020 | 78 | < 0.1% |
| 8956 | 1656 | |
| 8917 | 1645 | |
| 8888 | 3627 | |
| 8852 | 494 | < 0.1% |
| 8815 | 35 | < 0.1% |
| Distinct | 249 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.8 MiB |
| Swimwear | |
|---|---|
| Trouser | 66248 |
| Blouse | 62716 |
| Knitwear | 60613 |
| Jersey | 57898 |
| Other values (244) |
Length
| Max length | 40 |
|---|---|
| Median length | 34 |
| Mean length | 10.72118943 |
| Min length | 2 |
Characters and Unicode
| Total characters | 13187063 |
|---|---|
| Distinct characters | 60 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Expressive Lingerie |
|---|---|
| 2nd row | Knitwear |
| 3rd row | Tops Fancy Jersey |
| 4th row | Young Girl Jersey Basic |
| 5th row | Knitwear |
Common Values
| Value | Count | Frequency (%) |
| Swimwear | 94329 | 7.7% |
| Trouser | 66248 | 5.4% |
| Blouse | 62716 | 5.1% |
| Knitwear | 60613 | 4.9% |
| Jersey | 57898 | 4.7% |
| Jersey Basic | 55688 | 4.5% |
| Expressive Lingerie | 45629 | 3.7% |
| Jersey fancy | 43741 | 3.6% |
| Basic 1 | 43577 | 3.5% |
| Dress | 42206 | 3.4% |
| Other values (239) | 657355 |
Length
| Value | Count | Frequency (%) |
| jersey | 235721 | 11.5% |
| basic | 134594 | 6.6% |
| swimwear | 96185 | 4.7% |
| fancy | 94223 | 4.6% |
| knitwear | 93269 | 4.5% |
| lingerie | 83901 | 4.1% |
| blouse | 69568 | 3.4% |
| trouser | 69107 | 3.4% |
| tops | 68281 | 3.3% |
| trousers | 64328 | 3.1% |
| Other values (132) | 1043029 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1699899 | 12.9% |
| s | 1414715 | 10.7% |
| r | 1252118 | 9.5% |
| i | 865421 | 6.6% |
| 822206 | 6.2% | |
| a | 736622 | 5.6% |
| o | 628710 | 4.8% |
| n | 471326 | 3.6% |
| t | 462198 | 3.5% |
| y | 394158 | 3.0% |
| Other values (50) | 4439690 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10319293 | |
| Uppercase Letter | 1946923 | 14.8% |
| Space Separator | 822206 | 6.2% |
| Other Punctuation | 48727 | 0.4% |
| Decimal Number | 45138 | 0.3% |
| Math Symbol | 4312 | < 0.1% |
| Dash Punctuation | 464 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1699899 | |
| s | 1414715 | |
| r | 1252118 | |
| i | 865421 | |
| a | 736622 | 7.1% |
| o | 628710 | 6.1% |
| n | 471326 | 4.6% |
| t | 462198 | 4.5% |
| y | 394158 | 3.8% |
| w | 342693 | 3.3% |
| Other values (16) | 2051433 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 309312 | |
| S | 296954 | |
| J | 253747 | |
| T | 215706 | |
| D | 158111 | |
| L | 150740 | |
| K | 119734 | 6.1% |
| F | 63771 | 3.3% |
| E | 62709 | 3.2% |
| W | 59850 | 3.1% |
| Other values (13) | 256289 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 44664 | |
| 5 | 314 | 0.7% |
| 6 | 80 | 0.2% |
| 2 | 78 | 0.2% |
| 7 | 2 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 28116 | |
| / | 20226 | |
| . | 385 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 822206 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 4312 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 464 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12266216 | |
| Common | 920847 | 7.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1699899 | |
| s | 1414715 | 11.5% |
| r | 1252118 | 10.2% |
| i | 865421 | 7.1% |
| a | 736622 | 6.0% |
| o | 628710 | 5.1% |
| n | 471326 | 3.8% |
| t | 462198 | 3.8% |
| y | 394158 | 3.2% |
| w | 342693 | 2.8% |
| Other values (39) | 3998356 |
Common
| Value | Count | Frequency (%) |
| 822206 | ||
| 1 | 44664 | 4.9% |
| & | 28116 | 3.1% |
| / | 20226 | 2.2% |
| + | 4312 | 0.5% |
| - | 464 | 0.1% |
| . | 385 | < 0.1% |
| 5 | 314 | < 0.1% |
| 6 | 80 | < 0.1% |
| 2 | 78 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13187063 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1699899 | 12.9% |
| s | 1414715 | 10.7% |
| r | 1252118 | 9.5% |
| i | 865421 | 6.6% |
| 822206 | 6.2% | |
| a | 736622 | 5.6% |
| o | 628710 | 4.8% |
| n | 471326 | 3.6% |
| t | 462198 | 3.5% |
| y | 394158 | 3.0% |
| Other values (50) | 4439690 |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.8 MiB |
| A | |
|---|---|
| D | |
| B | |
| C | |
| F | |
| Other values (5) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1230000 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | B |
|---|---|
| 2nd row | A |
| 3rd row | D |
| 4th row | I |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| A | 496142 | |
| D | 273290 | |
| B | 212014 | |
| C | 72260 | 5.9% |
| F | 70450 | 5.7% |
| S | 47961 | 3.9% |
| I | 21369 | 1.7% |
| H | 18015 | 1.5% |
| G | 12828 | 1.0% |
| J | 5671 | 0.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| a | 496142 | |
| d | 273290 | |
| b | 212014 | |
| c | 72260 | 5.9% |
| f | 70450 | 5.7% |
| s | 47961 | 3.9% |
| i | 21369 | 1.7% |
| h | 18015 | 1.5% |
| g | 12828 | 1.0% |
| j | 5671 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 496142 | |
| D | 273290 | |
| B | 212014 | |
| C | 72260 | 5.9% |
| F | 70450 | 5.7% |
| S | 47961 | 3.9% |
| I | 21369 | 1.7% |
| H | 18015 | 1.5% |
| G | 12828 | 1.0% |
| J | 5671 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1230000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 496142 | |
| D | 273290 | |
| B | 212014 | |
| C | 72260 | 5.9% |
| F | 70450 | 5.7% |
| S | 47961 | 3.9% |
| I | 21369 | 1.7% |
| H | 18015 | 1.5% |
| G | 12828 | 1.0% |
| J | 5671 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1230000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 496142 | |
| D | 273290 | |
| B | 212014 | |
| C | 72260 | 5.9% |
| F | 70450 | 5.7% |
| S | 47961 | 3.9% |
| I | 21369 | 1.7% |
| H | 18015 | 1.5% |
| G | 12828 | 1.0% |
| J | 5671 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1230000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 496142 | |
| D | 273290 | |
| B | 212014 | |
| C | 72260 | 5.9% |
| F | 70450 | 5.7% |
| S | 47961 | 3.9% |
| I | 21369 | 1.7% |
| H | 18015 | 1.5% |
| G | 12828 | 1.0% |
| J | 5671 | 0.5% |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.8 MiB |
| Ladieswear | |
|---|---|
| Divided | |
| Lingeries/Tights | |
| Ladies Accessories | |
| Menswear | |
| Other values (5) |
Length
| Max length | 30 |
|---|---|
| Median length | 22 |
| Mean length | 11.05249593 |
| Min length | 5 |
Characters and Unicode
| Total characters | 13594570 |
|---|---|
| Distinct characters | 41 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Lingeries/Tights |
|---|---|
| 2nd row | Ladieswear |
| 3rd row | Divided |
| 4th row | Children Sizes 134-170 |
| 5th row | Ladieswear |
Common Values
| Value | Count | Frequency (%) |
| Ladieswear | 496142 | |
| Divided | 273290 | |
| Lingeries/Tights | 212014 | |
| Ladies Accessories | 72260 | 5.9% |
| Menswear | 70450 | 5.7% |
| Sport | 47961 | 3.9% |
| Children Sizes 134-170 | 21369 | 1.7% |
| Children Sizes 92-140 | 18015 | 1.5% |
| Baby Sizes 50-98 | 12828 | 1.0% |
| Children Accessories, Swimwear | 5671 | 0.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| ladieswear | 496142 | |
| divided | 273290 | |
| lingeries/tights | 212014 | |
| accessories | 77931 | 5.5% |
| ladies | 72260 | 5.1% |
| menswear | 70450 | 5.0% |
| sizes | 52212 | 3.7% |
| sport | 47961 | 3.4% |
| children | 45055 | 3.2% |
| 134-170 | 21369 | 1.5% |
| Other values (4) | 49342 | 3.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2161562 | |
| i | 1931893 | |
| s | 1348885 | |
| d | 1160037 | |
| a | 1153493 | |
| r | 955224 | 7.0% |
| L | 780416 | 5.7% |
| w | 577934 | 4.3% |
| g | 424028 | 3.1% |
| n | 327519 | 2.4% |
| Other values (31) | 2773579 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11289218 | |
| Uppercase Letter | 1577828 | 11.6% |
| Decimal Number | 269601 | 2.0% |
| Other Punctuation | 217685 | 1.6% |
| Space Separator | 188026 | 1.4% |
| Dash Punctuation | 52212 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2161562 | |
| i | 1931893 | |
| s | 1348885 | |
| d | 1160037 | |
| a | 1153493 | |
| r | 955224 | |
| w | 577934 | 5.1% |
| g | 424028 | 3.8% |
| n | 327519 | 2.9% |
| v | 273290 | 2.4% |
| Other values (10) | 975353 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 60753 | |
| 0 | 52212 | |
| 4 | 39384 | |
| 9 | 30843 | |
| 3 | 21369 | 7.9% |
| 7 | 21369 | 7.9% |
| 2 | 18015 | 6.7% |
| 5 | 12828 | 4.8% |
| 8 | 12828 | 4.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 780416 | |
| D | 273290 | 17.3% |
| T | 212014 | 13.4% |
| S | 105844 | 6.7% |
| A | 77931 | 4.9% |
| M | 70450 | 4.5% |
| C | 45055 | 2.9% |
| B | 12828 | 0.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 212014 | |
| , | 5671 | 2.6% |
Space Separator
| Value | Count | Frequency (%) |
| 188026 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 52212 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12867046 | |
| Common | 727524 | 5.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2161562 | |
| i | 1931893 | |
| s | 1348885 | |
| d | 1160037 | |
| a | 1153493 | |
| r | 955224 | |
| L | 780416 | 6.1% |
| w | 577934 | 4.5% |
| g | 424028 | 3.3% |
| n | 327519 | 2.5% |
| Other values (18) | 2046055 |
Common
| Value | Count | Frequency (%) |
| / | 212014 | |
| 188026 | ||
| 1 | 60753 | 8.4% |
| 0 | 52212 | 7.2% |
| - | 52212 | 7.2% |
| 4 | 39384 | 5.4% |
| 9 | 30843 | 4.2% |
| 3 | 21369 | 2.9% |
| 7 | 21369 | 2.9% |
| 2 | 18015 | 2.5% |
| Other values (3) | 31327 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13594570 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2161562 | |
| i | 1931893 | |
| s | 1348885 | |
| d | 1160037 | |
| a | 1153493 | |
| r | 955224 | 7.0% |
| L | 780416 | 5.7% |
| w | 577934 | 4.3% |
| g | 424028 | 3.1% |
| n | 327519 | 2.4% |
| Other values (31) | 2773579 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.8 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | 70450 |
| 4 | 57883 |
| 26 | 47961 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.038992683 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1277961 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 2 |
| 4th row | 4 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 780416 | |
| 2 | 273290 | 22.2% |
| 3 | 70450 | 5.7% |
| 4 | 57883 | 4.7% |
| 26 | 47961 | 3.9% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1 | 780416 | |
| 2 | 273290 | 22.2% |
| 3 | 70450 | 5.7% |
| 4 | 57883 | 4.7% |
| 26 | 47961 | 3.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 780416 | |
| 2 | 321251 | |
| 3 | 70450 | 5.5% |
| 4 | 57883 | 4.5% |
| 6 | 47961 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1277961 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 780416 | |
| 2 | 321251 | |
| 3 | 70450 | 5.5% |
| 4 | 57883 | 4.5% |
| 6 | 47961 | 3.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1277961 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 780416 | |
| 2 | 321251 | |
| 3 | 70450 | 5.5% |
| 4 | 57883 | 4.5% |
| 6 | 47961 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1277961 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 780416 | |
| 2 | 321251 | |
| 3 | 70450 | 5.5% |
| 4 | 57883 | 4.5% |
| 6 | 47961 | 3.8% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.8 MiB |
| Ladieswear | |
|---|---|
| Divided | |
| Menswear | 70450 |
| Baby/Children | 57883 |
| Sport | 47961 |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 9.165100813 |
| Min length | 5 |
Characters and Unicode
| Total characters | 11273074 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Ladieswear |
|---|---|
| 2nd row | Ladieswear |
| 3rd row | Divided |
| 4th row | Baby/Children |
| 5th row | Ladieswear |
Common Values
| Value | Count | Frequency (%) |
| Ladieswear | 780416 | |
| Divided | 273290 | 22.2% |
| Menswear | 70450 | 5.7% |
| Baby/Children | 57883 | 4.7% |
| Sport | 47961 | 3.9% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| ladieswear | 780416 | |
| divided | 273290 | 22.2% |
| menswear | 70450 | 5.7% |
| baby/children | 57883 | 4.7% |
| sport | 47961 | 3.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2032905 | |
| a | 1689165 | |
| d | 1384879 | |
| i | 1384879 | |
| r | 956710 | |
| s | 850866 | |
| w | 850866 | |
| L | 780416 | 6.9% |
| D | 273290 | 2.4% |
| v | 273290 | 2.4% |
| Other values (13) | 795808 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9927308 | |
| Uppercase Letter | 1287883 | 11.4% |
| Other Punctuation | 57883 | 0.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2032905 | |
| a | 1689165 | |
| d | 1384879 | |
| i | 1384879 | |
| r | 956710 | |
| s | 850866 | |
| w | 850866 | |
| v | 273290 | 2.8% |
| n | 128333 | 1.3% |
| b | 57883 | 0.6% |
| Other values (6) | 317532 | 3.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 780416 | |
| D | 273290 | 21.2% |
| M | 70450 | 5.5% |
| B | 57883 | 4.5% |
| C | 57883 | 4.5% |
| S | 47961 | 3.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 57883 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11215191 | |
| Common | 57883 | 0.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2032905 | |
| a | 1689165 | |
| d | 1384879 | |
| i | 1384879 | |
| r | 956710 | |
| s | 850866 | |
| w | 850866 | |
| L | 780416 | 7.0% |
| D | 273290 | 2.4% |
| v | 273290 | 2.4% |
| Other values (12) | 737925 | 6.6% |
Common
| Value | Count | Frequency (%) |
| / | 57883 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11273074 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2032905 | |
| a | 1689165 | |
| d | 1384879 | |
| i | 1384879 | |
| r | 956710 | |
| s | 850866 | |
| w | 850866 | |
| L | 780416 | 6.9% |
| D | 273290 | 2.4% |
| v | 273290 | 2.4% |
| Other values (13) | 795808 | 7.1% |
| Distinct | 57 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36.82913659 |
| Minimum | 2 |
|---|---|
| Maximum | 97 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 18.8 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 15 |
| median | 47 |
| Q3 | 60 |
| 95-th percentile | 66 |
| Maximum | 97 |
| Range | 95 |
| Interquartile range (IQR) | 45 |
Descriptive statistics
| Standard deviation | 23.08040963 |
|---|---|
| Coefficient of variation (CV) | 0.6266888602 |
| Kurtosis | -1.627263438 |
| Mean | 36.82913659 |
| Median Absolute Deviation (MAD) | 21 |
| Skewness | -0.01317993478 |
| Sum | 45299838 |
| Variance | 532.7053087 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 15 | 221946 | |
| 53 | 143474 | 11.7% |
| 60 | 92729 | 7.5% |
| 61 | 83901 | 6.8% |
| 11 | 80965 | 6.6% |
| 16 | 58506 | 4.8% |
| 51 | 49438 | 4.0% |
| 6 | 46281 | 3.8% |
| 5 | 44915 | 3.7% |
| 62 | 42374 | 3.4% |
| Other values (47) | 365471 |
| Value | Count | Frequency (%) |
| 2 | 21544 | 1.8% |
| 4 | 22 | < 0.1% |
| 5 | 44915 | 3.7% |
| 6 | 46281 | 3.8% |
| 8 | 9788 | 0.8% |
| 11 | 80965 | 6.6% |
| 14 | 4985 | 0.4% |
| 15 | 221946 | |
| 16 | 58506 | 4.8% |
| 17 | 18 | < 0.1% |
| Value | Count | Frequency (%) |
| 97 | 1861 | 0.2% |
| 82 | 2044 | 0.2% |
| 80 | 2309 | 0.2% |
| 79 | 5835 | 0.5% |
| 77 | 8561 | 0.7% |
| 76 | 6789 | 0.6% |
| 72 | 4039 | 0.3% |
| 71 | 68 | < 0.1% |
| 70 | 1165 | 0.1% |
| 66 | 28869 |
| Distinct | 56 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.8 MiB |
| Womens Everyday Collection | |
|---|---|
| Divided Collection | |
| Womens Swimwear, beachwear | |
| Womens Lingerie | |
| Womens Tailoring | |
| Other values (51) |
Length
| Max length | 30 |
|---|---|
| Median length | 26 |
| Mean length | 18.95364472 |
| Min length | 4 |
Characters and Unicode
| Total characters | 23312983 |
|---|---|
| Distinct characters | 48 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Womens Lingerie |
|---|---|
| 2nd row | Womens Everyday Collection |
| 3rd row | Divided Collection |
| 4th row | Girls Underwear & Basics |
| 5th row | Womens Everyday Collection |
Common Values
| Value | Count | Frequency (%) |
| Womens Everyday Collection | 221946 | |
| Divided Collection | 143474 | 11.7% |
| Womens Swimwear, beachwear | 92729 | 7.5% |
| Womens Lingerie | 83901 | 6.8% |
| Womens Tailoring | 80965 | 6.6% |
| Womens Everyday Basics | 58506 | 4.8% |
| Divided Basics | 49438 | 4.0% |
| Womens Casual | 46281 | 3.8% |
| Ladies H&M Sport | 44915 | 3.7% |
| Womens Nightwear, Socks & Tigh | 42374 | 3.4% |
| Other values (46) | 365471 |
Length
| Value | Count | Frequency (%) |
| womens | 739904 | |
| collection | 365420 | |
| everyday | 280452 | 9.1% |
| divided | 238577 | 7.7% |
| basics | 117818 | 3.8% |
| swimwear | 95346 | 3.1% |
| beachwear | 92729 | 3.0% |
| tailoring | 87204 | 2.8% |
| ladies | 84903 | 2.8% |
| lingerie | 83901 | 2.7% |
| Other values (49) | 899695 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2706043 | 11.6% |
| o | 1863408 | 8.0% |
| 1855949 | 8.0% | |
| i | 1770964 | 7.6% |
| s | 1493935 | 6.4% |
| n | 1489102 | 6.4% |
| a | 1218824 | 5.2% |
| r | 1019824 | 4.4% |
| l | 992992 | 4.3% |
| m | 984603 | 4.2% |
| Other values (38) | 7917339 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18216520 | |
| Uppercase Letter | 2941677 | 12.6% |
| Space Separator | 1855949 | 8.0% |
| Other Punctuation | 277107 | 1.2% |
| Math Symbol | 21544 | 0.1% |
| Decimal Number | 186 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2706043 | |
| o | 1863408 | |
| i | 1770964 | |
| s | 1493935 | 8.2% |
| n | 1489102 | 8.2% |
| a | 1218824 | 6.7% |
| r | 1019824 | 5.6% |
| l | 992992 | 5.5% |
| m | 984603 | 5.4% |
| d | 935798 | 5.1% |
| Other values (12) | 3741027 |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 739904 | |
| C | 457760 | |
| E | 287939 | 9.8% |
| D | 286680 | 9.7% |
| S | 280092 | 9.5% |
| B | 170310 | 5.8% |
| L | 169301 | 5.8% |
| T | 151862 | 5.2% |
| M | 126137 | 4.3% |
| H | 68710 | 2.3% |
| Other values (11) | 202982 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 139387 | |
| , | 137720 |
Space Separator
| Value | Count | Frequency (%) |
| 1855949 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 21544 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 186 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21158197 | |
| Common | 2154786 | 9.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2706043 | |
| o | 1863408 | 8.8% |
| i | 1770964 | 8.4% |
| s | 1493935 | 7.1% |
| n | 1489102 | 7.0% |
| a | 1218824 | 5.8% |
| r | 1019824 | 4.8% |
| l | 992992 | 4.7% |
| m | 984603 | 4.7% |
| d | 935798 | 4.4% |
| Other values (33) | 6682704 |
Common
| Value | Count | Frequency (%) |
| 1855949 | ||
| & | 139387 | 6.5% |
| , | 137720 | 6.4% |
| + | 21544 | 1.0% |
| 2 | 186 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23312983 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2706043 | 11.6% |
| o | 1863408 | 8.0% |
| 1855949 | 8.0% | |
| i | 1770964 | 7.6% |
| s | 1493935 | 6.4% |
| n | 1489102 | 6.4% |
| a | 1218824 | 5.2% |
| r | 1019824 | 4.4% |
| l | 992992 | 4.3% |
| m | 984603 | 4.2% |
| Other values (38) | 7917339 |
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1010.743174 |
| Minimum | 1001 |
|---|---|
| Maximum | 1025 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 18.8 MiB |
Quantile statistics
| Minimum | 1001 |
|---|---|
| 5-th percentile | 1002 |
| Q1 | 1005 |
| median | 1010 |
| Q3 | 1017 |
| 95-th percentile | 1021 |
| Maximum | 1025 |
| Range | 24 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 6.562653437 |
|---|---|
| Coefficient of variation (CV) | 0.006492899092 |
| Kurtosis | -1.181688205 |
| Mean | 1010.743174 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0.2385878965 |
| Sum | 1243214104 |
| Variance | 43.06842013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1005 | 200722 | |
| 1002 | 126038 | |
| 1017 | 114126 | |
| 1009 | 112538 | |
| 1018 | 96185 | |
| 1010 | 95802 | |
| 1003 | 91639 | |
| 1013 | 81614 | |
| 1019 | 69042 | 5.6% |
| 1016 | 48954 | 4.0% |
| Other values (11) | 193340 |
| Value | Count | Frequency (%) |
| 1001 | 15739 | 1.3% |
| 1002 | 126038 | |
| 1003 | 91639 | |
| 1005 | 200722 | |
| 1006 | 2918 | 0.2% |
| 1007 | 28271 | 2.3% |
| 1008 | 15791 | 1.3% |
| 1009 | 112538 | |
| 1010 | 95802 | |
| 1011 | 8463 | 0.7% |
| Value | Count | Frequency (%) |
| 1025 | 28396 | 2.3% |
| 1023 | 8910 | 0.7% |
| 1021 | 28964 | 2.4% |
| 1020 | 29441 | 2.4% |
| 1019 | 69042 | |
| 1018 | 96185 | |
| 1017 | 114126 | |
| 1016 | 48954 | |
| 1014 | 2650 | 0.2% |
| 1013 | 81614 |
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.8 MiB |
| Jersey Fancy | |
|---|---|
| Jersey Basic | |
| Under-, Nightwear | |
| Trousers | |
| Swimwear | |
| Other values (16) |
Length
| Max length | 29 |
|---|---|
| Median length | 20 |
| Mean length | 10.7135935 |
| Min length | 5 |
Characters and Unicode
| Total characters | 13177720 |
|---|---|
| Distinct characters | 40 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Under-, Nightwear |
|---|---|
| 2nd row | Knitwear |
| 3rd row | Jersey Fancy |
| 4th row | Jersey Basic |
| 5th row | Knitwear |
Common Values
| Value | Count | Frequency (%) |
| Jersey Fancy | 200722 | |
| Jersey Basic | 126038 | |
| Under-, Nightwear | 114126 | |
| Trousers | 112538 | |
| Swimwear | 96185 | |
| Blouses | 95802 | |
| Knitwear | 91639 | |
| Dresses Ladies | 81614 | |
| Accessories | 69042 | 5.6% |
| Trousers Denim | 48954 | 4.0% |
| Other values (11) | 193340 |
Length
| Value | Count | Frequency (%) |
| jersey | 326760 | |
| fancy | 200722 | |
| trousers | 161492 | 8.6% |
| basic | 126038 | 6.7% |
| under | 114126 | 6.1% |
| nightwear | 114126 | 6.1% |
| swimwear | 96185 | 5.1% |
| blouses | 95802 | 5.1% |
| knitwear | 91639 | 4.9% |
| dresses | 81614 | 4.3% |
| Other values (20) | 468274 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1854585 | |
| s | 1705653 | |
| r | 1340972 | 10.2% |
| a | 751116 | 5.7% |
| i | 708868 | 5.4% |
| 646778 | 4.9% | |
| n | 537458 | 4.1% |
| y | 533318 | 4.0% |
| c | 502718 | 3.8% |
| o | 488336 | 3.7% |
| Other values (30) | 4107918 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10443472 | |
| Uppercase Letter | 1850732 | 14.0% |
| Space Separator | 646778 | 4.9% |
| Other Punctuation | 122612 | 0.9% |
| Dash Punctuation | 114126 | 0.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1854585 | |
| s | 1705653 | |
| r | 1340972 | |
| a | 751116 | |
| i | 708868 | 6.8% |
| n | 537458 | 5.1% |
| y | 533318 | 5.1% |
| c | 502718 | 4.8% |
| o | 488336 | 4.7% |
| w | 413874 | 4.0% |
| Other values (13) | 1606574 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 329678 | |
| S | 226806 | |
| B | 224758 | |
| F | 200722 | |
| T | 190456 | |
| D | 149009 | |
| U | 129865 | 7.0% |
| N | 114126 | 6.2% |
| K | 94557 | 5.1% |
| L | 81614 | 4.4% |
| Other values (3) | 109141 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 114126 | |
| / | 8486 | 6.9% |
Space Separator
| Value | Count | Frequency (%) |
| 646778 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 114126 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12294204 | |
| Common | 883516 | 6.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1854585 | |
| s | 1705653 | |
| r | 1340972 | 10.9% |
| a | 751116 | 6.1% |
| i | 708868 | 5.8% |
| n | 537458 | 4.4% |
| y | 533318 | 4.3% |
| c | 502718 | 4.1% |
| o | 488336 | 4.0% |
| w | 413874 | 3.4% |
| Other values (26) | 3457306 |
Common
| Value | Count | Frequency (%) |
| 646778 | ||
| , | 114126 | 12.9% |
| - | 114126 | 12.9% |
| / | 8486 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13177720 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1854585 | |
| s | 1705653 | |
| r | 1340972 | 10.2% |
| a | 751116 | 5.7% |
| i | 708868 | 5.4% |
| 646778 | 4.9% | |
| n | 537458 | 4.1% |
| y | 533318 | 4.0% |
| c | 502718 | 3.8% |
| o | 488336 | 3.7% |
| Other values (30) | 4107918 |
| Distinct | 36009 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 4475 |
| Missing (%) | 0.4% |
| Memory size | 18.8 MiB |
| High-waisted jeans in washed superstretch denim with a zip fly and button, fake front pockets, real back pockets and super-skinny legs. | 8443 |
|---|---|
| 5-pocket jeans in washed, superstretch denim with a regular waist, zip fly and button, and skinny legs. | 6015 |
| T-shirt in lightweight jersey with a rounded hem. Slightly longer at the back. | 5279 |
| Fully lined bikini bottoms with a mid waist and medium coverage at the back. | 4784 |
| Blouse in a soft weave with a narrow collar, concealed buttons down the front, long sleeves with buttoned cuffs and a rounded hem. | 4325 |
| Other values (36004) |
Length
| Max length | 698 |
|---|---|
| Median length | 441 |
| Mean length | 135.7135081 |
| Min length | 11 |
Characters and Unicode
| Total characters | 166320297 |
|---|---|
| Distinct characters | 92 |
| Distinct categories | 14 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 4 ? |
Unique
| Unique | 5777 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | Lace push-up bra with underwired, moulded, padded cups for a larger bust and fuller cleavage. Narrow, adjustable shoulder straps and a narrow fastening at the back with two pairs of hooks and eyes. |
|---|---|
| 2nd row | Long polo-neck jumper in a soft knit with long raglan sleeves and ribbing at the cuffs and hem. |
| 3rd row | Cropped top in airy, fluted jersey with narrow, adjustable shoulder straps, buttons down the front and narrow, covered elastication and a tie detail at the hem. |
| 4th row | Long-sleeved dress in cotton jersey with a seam at the waist and bell-shaped skirt. |
| 5th row | Wide, long-sleeved jumper in a soft, rib knit containing some wool. |
Common Values
| Value | Count | Frequency (%) |
| High-waisted jeans in washed superstretch denim with a zip fly and button, fake front pockets, real back pockets and super-skinny legs. | 8443 | 0.7% |
| 5-pocket jeans in washed, superstretch denim with a regular waist, zip fly and button, and skinny legs. | 6015 | 0.5% |
| T-shirt in lightweight jersey with a rounded hem. Slightly longer at the back. | 5279 | 0.4% |
| Fully lined bikini bottoms with a mid waist and medium coverage at the back. | 4784 | 0.4% |
| Blouse in a soft weave with a narrow collar, concealed buttons down the front, long sleeves with buttoned cuffs and a rounded hem. | 4325 | 0.4% |
| T-shirt in soft jersey. | 3082 | 0.3% |
| Fine-knit trainer socks in a soft cotton blend. | 2855 | 0.2% |
| Lined, non-wired, triangle bikini top with a wide hem. Narrow, adjustable shoulder straps that can be fastened in different ways at the back and cups with removable inserts that shape the bust and provide good support. No fasteners. | 2786 | 0.2% |
| Ankle-length cigarette trousers in a stretch weave with a regular waist, concealed zip in one side, fake back pockets and tapered legs with slits at the hems. | 2699 | 0.2% |
| Round-necked T-shirt in soft cotton jersey. | 2686 | 0.2% |
| Other values (35999) | 1182571 | |
| (Missing) | 4475 | 0.4% |
Length
| Value | Count | Frequency (%) |
| and | 1761811 | 6.3% |
| a | 1727004 | 6.2% |
| with | 1676940 | 6.0% |
| the | 1484849 | 5.3% |
| in | 1162727 | 4.2% |
| at | 921066 | 3.3% |
| back | 505927 | 1.8% |
| waist | 440039 | 1.6% |
| top | 357519 | 1.3% |
| front | 355346 | 1.3% |
| Other values (4514) | 17620823 |
Most occurring characters
| Value | Count | Frequency (%) |
| 26788763 | ||
| e | 14914610 | 9.0% |
| t | 13469186 | 8.1% |
| a | 11462837 | 6.9% |
| n | 9795860 | 5.9% |
| i | 9678927 | 5.8% |
| s | 9256646 | 5.6% |
| o | 7392234 | 4.4% |
| d | 6840616 | 4.1% |
| r | 6815278 | 4.1% |
| Other values (82) | 49905340 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 131338416 | |
| Space Separator | 26788763 | 16.1% |
| Other Punctuation | 4064909 | 2.4% |
| Uppercase Letter | 2410808 | 1.4% |
| Dash Punctuation | 1359658 | 0.8% |
| Decimal Number | 323393 | 0.2% |
| Open Punctuation | 9757 | < 0.1% |
| Close Punctuation | 9757 | < 0.1% |
| Other Symbol | 8382 | < 0.1% |
| Final Punctuation | 5808 | < 0.1% |
| Other values (4) | 646 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 14914610 | |
| t | 13469186 | 10.3% |
| a | 11462837 | 8.7% |
| n | 9795860 | 7.5% |
| i | 9678927 | 7.4% |
| s | 9256646 | 7.0% |
| o | 7392234 | 5.6% |
| d | 6840616 | 5.2% |
| r | 6815278 | 5.2% |
| h | 6302467 | 4.8% |
| Other values (19) | 35409755 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 529115 | |
| L | 263848 | |
| T | 242186 | |
| V | 179801 | 7.5% |
| F | 175722 | 7.3% |
| U | 103572 | 4.3% |
| C | 99846 | 4.1% |
| B | 86644 | 3.6% |
| A | 85612 | 3.6% |
| H | 82815 | 3.4% |
| Other values (17) | 561647 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2219772 | |
| , | 1815388 | |
| / | 25071 | 0.6% |
| & | 3411 | 0.1% |
| % | 969 | < 0.1% |
| : | 126 | < 0.1% |
| ' | 103 | < 0.1% |
| ! | 43 | < 0.1% |
| " | 20 | < 0.1% |
| ? | 6 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 110261 | |
| 3 | 47091 | |
| 4 | 42421 | 13.1% |
| 1 | 30094 | 9.3% |
| 2 | 29494 | 9.1% |
| 0 | 25098 | 7.8% |
| 8 | 12978 | 4.0% |
| 6 | 10217 | 3.2% |
| 7 | 9086 | 2.8% |
| 9 | 6653 | 2.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1344936 | |
| – | 14712 | 1.1% |
| ‒ | 10 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ™ | 7597 | |
| ® | 756 | 9.0% |
| ° | 29 | 0.3% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 5781 | |
| ” | 27 | 0.5% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 3 | |
| ‘ | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 26788763 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 9757 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 9757 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 608 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 29 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 133749224 | |
| Common | 32571073 | 19.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 14914610 | 11.2% |
| t | 13469186 | 10.1% |
| a | 11462837 | 8.6% |
| n | 9795860 | 7.3% |
| i | 9678927 | 7.2% |
| s | 9256646 | 6.9% |
| o | 7392234 | 5.5% |
| d | 6840616 | 5.1% |
| r | 6815278 | 5.1% |
| h | 6302467 | 4.7% |
| Other values (46) | 37820563 |
Common
| Value | Count | Frequency (%) |
| 26788763 | ||
| . | 2219772 | 6.8% |
| , | 1815388 | 5.6% |
| - | 1344936 | 4.1% |
| 5 | 110261 | 0.3% |
| 3 | 47091 | 0.1% |
| 4 | 42421 | 0.1% |
| 1 | 30094 | 0.1% |
| 2 | 29494 | 0.1% |
| 0 | 25098 | 0.1% |
| Other values (26) | 117755 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 166244942 | |
| None | 47223 | < 0.1% |
| Punctuation | 20535 | < 0.1% |
| Letterlike Symbols | 7597 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 26788763 | ||
| e | 14914610 | 9.0% |
| t | 13469186 | 8.1% |
| a | 11462837 | 6.9% |
| n | 9795860 | 5.9% |
| i | 9678927 | 5.8% |
| s | 9256646 | 5.6% |
| o | 7392234 | 4.4% |
| d | 6840616 | 4.1% |
| r | 6815278 | 4.1% |
| Other values (67) | 49829985 |
None
| Value | Count | Frequency (%) |
| ê | 38482 | |
| é | 7124 | 15.1% |
| ® | 756 | 1.6% |
| ½ | 608 | 1.3% |
| É | 212 | 0.4% |
| ° | 29 | 0.1% |
| ñ | 8 | < 0.1% |
| ´ | 4 | < 0.1% |
Punctuation
| Value | Count | Frequency (%) |
| – | 14712 | |
| ’ | 5781 | 28.2% |
| ” | 27 | 0.1% |
| ‒ | 10 | < 0.1% |
| “ | 3 | < 0.1% |
| ‘ | 2 | < 0.1% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 7597 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 708745 |
| Missing (%) | 57.6% |
| Memory size | 18.8 MiB |
| 1.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1563765 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 521255 | |
| (Missing) | 708745 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1.0 | 521255 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 521255 | |
| . | 521255 | |
| 0 | 521255 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1042510 | |
| Other Punctuation | 521255 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 521255 | |
| 0 | 521255 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 521255 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1563765 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 521255 | |
| . | 521255 | |
| 0 | 521255 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1563765 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 521255 | |
| . | 521255 | |
| 0 | 521255 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 716695 |
| Missing (%) | 58.3% |
| Memory size | 18.8 MiB |
| 1.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1539915 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 513305 | |
| (Missing) | 716695 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1.0 | 513305 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 513305 | |
| . | 513305 | |
| 0 | 513305 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1026610 | |
| Other Punctuation | 513305 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 513305 | |
| 0 | 513305 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 513305 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1539915 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 513305 | |
| . | 513305 | |
| 0 | 513305 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1539915 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 513305 | |
| . | 513305 | |
| 0 | 513305 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2527 |
| Missing (%) | 0.2% |
| Memory size | 18.8 MiB |
| ACTIVE | |
|---|---|
| PRE-CREATE | 27423 |
| LEFT CLUB | 331 |
Length
| Max length | 10 |
|---|---|
| Median length | 6 |
| Mean length | 6.090173063 |
| Min length | 6 |
Characters and Unicode
| Total characters | 7475523 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ACTIVE |
|---|---|
| 2nd row | ACTIVE |
| 3rd row | ACTIVE |
| 4th row | ACTIVE |
| 5th row | ACTIVE |
Common Values
| Value | Count | Frequency (%) |
| ACTIVE | 1199719 | |
| PRE-CREATE | 27423 | 2.2% |
| LEFT CLUB | 331 | < 0.1% |
| (Missing) | 2527 | 0.2% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| active | 1199719 | |
| pre-create | 27423 | 2.2% |
| left | 331 | < 0.1% |
| club | 331 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1282319 | |
| C | 1227473 | |
| T | 1227473 | |
| A | 1227142 | |
| I | 1199719 | |
| V | 1199719 | |
| R | 54846 | 0.7% |
| P | 27423 | 0.4% |
| - | 27423 | 0.4% |
| L | 662 | < 0.1% |
| Other values (4) | 1324 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7447769 | |
| Dash Punctuation | 27423 | 0.4% |
| Space Separator | 331 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1282319 | |
| C | 1227473 | |
| T | 1227473 | |
| A | 1227142 | |
| I | 1199719 | |
| V | 1199719 | |
| R | 54846 | 0.7% |
| P | 27423 | 0.4% |
| L | 662 | < 0.1% |
| F | 331 | < 0.1% |
| Other values (2) | 662 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 27423 |
Space Separator
| Value | Count | Frequency (%) |
| 331 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7447769 | |
| Common | 27754 | 0.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 1282319 | |
| C | 1227473 | |
| T | 1227473 | |
| A | 1227142 | |
| I | 1199719 | |
| V | 1199719 | |
| R | 54846 | 0.7% |
| P | 27423 | 0.4% |
| L | 662 | < 0.1% |
| F | 331 | < 0.1% |
| Other values (2) | 662 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| - | 27423 | |
| 331 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7475523 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 1282319 | |
| C | 1227473 | |
| T | 1227473 | |
| A | 1227142 | |
| I | 1199719 | |
| V | 1199719 | |
| R | 54846 | 0.7% |
| P | 27423 | 0.4% |
| - | 27423 | 0.4% |
| L | 662 | < 0.1% |
| Other values (4) | 1324 | < 0.1% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5743 |
| Missing (%) | 0.5% |
| Memory size | 18.8 MiB |
| NONE | |
|---|---|
| Regularly | |
| Monthly | 427 |
Length
| Max length | 9 |
|---|---|
| Median length | 4 |
| Mean length | 6.134450528 |
| Min length | 4 |
Characters and Unicode
| Total characters | 7510144 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NONE |
|---|---|
| 2nd row | Regularly |
| 3rd row | Regularly |
| 4th row | NONE |
| 5th row | Regularly |
Common Values
| Value | Count | Frequency (%) |
| NONE | 701463 | |
| Regularly | 522367 | |
| Monthly | 427 | < 0.1% |
| (Missing) | 5743 | 0.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| none | 701463 | |
| regularly | 522367 | |
| monthly | 427 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1402926 | |
| l | 1045161 | |
| O | 701463 | |
| E | 701463 | |
| y | 522794 | 7.0% |
| R | 522367 | 7.0% |
| e | 522367 | 7.0% |
| g | 522367 | 7.0% |
| u | 522367 | 7.0% |
| a | 522367 | 7.0% |
| Other values (6) | 524502 | 7.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4181498 | |
| Uppercase Letter | 3328646 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 1045161 | |
| y | 522794 | |
| e | 522367 | |
| g | 522367 | |
| u | 522367 | |
| a | 522367 | |
| r | 522367 | |
| o | 427 | < 0.1% |
| n | 427 | < 0.1% |
| t | 427 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1402926 | |
| O | 701463 | |
| E | 701463 | |
| R | 522367 | 15.7% |
| M | 427 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7510144 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 1402926 | |
| l | 1045161 | |
| O | 701463 | |
| E | 701463 | |
| y | 522794 | 7.0% |
| R | 522367 | 7.0% |
| e | 522367 | 7.0% |
| g | 522367 | 7.0% |
| u | 522367 | 7.0% |
| a | 522367 | 7.0% |
| Other values (6) | 524502 | 7.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7510144 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 1402926 | |
| l | 1045161 | |
| O | 701463 | |
| E | 701463 | |
| y | 522794 | 7.0% |
| R | 522367 | 7.0% |
| e | 522367 | 7.0% |
| g | 522367 | 7.0% |
| u | 522367 | 7.0% |
| a | 522367 | 7.0% |
| Other values (6) | 524502 | 7.0% |
| Distinct | 83 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5741 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36.06006164 |
| Minimum | 16 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 18.8 MiB |
Quantile statistics
| Minimum | 16 |
|---|---|
| 5-th percentile | 21 |
| Q1 | 25 |
| median | 31 |
| Q3 | 47 |
| 95-th percentile | 59 |
| Maximum | 99 |
| Range | 83 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 13.01484531 |
|---|---|
| Coefficient of variation (CV) | 0.3609213274 |
| Kurtosis | -0.6333019993 |
| Mean | 36.06006164 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.6609682013 |
| Sum | 44146855 |
| Variance | 169.3861985 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 25 | 60940 | 5.0% |
| 26 | 60857 | 4.9% |
| 24 | 57759 | 4.7% |
| 27 | 56890 | 4.6% |
| 23 | 52069 | 4.2% |
| 28 | 51533 | 4.2% |
| 29 | 47786 | 3.9% |
| 30 | 45030 | 3.7% |
| 22 | 43149 | 3.5% |
| 21 | 42828 | 3.5% |
| Other values (73) | 705418 |
| Value | Count | Frequency (%) |
| 16 | 69 | < 0.1% |
| 17 | 2407 | 0.2% |
| 18 | 7976 | 0.6% |
| 19 | 17206 | 1.4% |
| 20 | 30427 | |
| 21 | 42828 | |
| 22 | 43149 | |
| 23 | 52069 | |
| 24 | 57759 | |
| 25 | 60940 |
| Value | Count | Frequency (%) |
| 99 | 3 | < 0.1% |
| 98 | 4 | < 0.1% |
| 97 | 1 | < 0.1% |
| 95 | 4 | < 0.1% |
| 94 | 5 | < 0.1% |
| 93 | 5 | < 0.1% |
| 92 | 9 | |
| 91 | 14 | |
| 90 | 12 | |
| 89 | 4 | < 0.1% |
| Distinct | 254541 |
|---|---|
| Distinct (%) | 20.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.8 MiB |
| 2c29ae653a9282cce4151bd87643c907644e09541abc28ae87dea0d1f6603b1c | 26486 |
|---|---|
| 5b7eb31eabebd3277de632b82267286d847fd5d44287ee150bb4206b48439145 | 229 |
| 7c1fa3b0ec1d37ce2c3f34f63bd792f3b4494f324b6be5d1e4ba6a75456b96a7 | 220 |
| 1f5bd429acc88fbbf24de844a59e438704aa8761bc7b99fd977cad297c50b74c | 206 |
| a5ca21aefc3cf90afd9b09faf3b0f8f3c423d4f1cfb4c2e33a1b86770e426fa8 | 206 |
| Other values (254536) |
Length
| Max length | 64 |
|---|---|
| Median length | 64 |
| Mean length | 64 |
| Min length | 64 |
Characters and Unicode
| Total characters | 78720000 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 64561 ? |
|---|---|
| Unique (%) | 5.2% |
Sample
| 1st row | 2fb862f50c007c58b21e045956ca10469feccd2dbe7ccc81e596dc5d926992f3 |
|---|---|
| 2nd row | f083eb09535f454fe68dfcae389b759cf9b71ff45271cb0a1a99996a9a6be1e6 |
| 3rd row | 624cb91bb12c602f4ce8165bb2b16af165ee85b4ee06add41f0d6a5fa61038d0 |
| 4th row | aa49a6081cd770489f90dfdf8677a1957110e505320d923bf4e913686569a8bb |
| 5th row | c84181c2d1711d7b9e76fcea1f045409a608320eaad5338d096196a92a8c1785 |
Common Values
| Value | Count | Frequency (%) |
| 2c29ae653a9282cce4151bd87643c907644e09541abc28ae87dea0d1f6603b1c | 26486 | 2.2% |
| 5b7eb31eabebd3277de632b82267286d847fd5d44287ee150bb4206b48439145 | 229 | < 0.1% |
| 7c1fa3b0ec1d37ce2c3f34f63bd792f3b4494f324b6be5d1e4ba6a75456b96a7 | 220 | < 0.1% |
| 1f5bd429acc88fbbf24de844a59e438704aa8761bc7b99fd977cad297c50b74c | 206 | < 0.1% |
| a5ca21aefc3cf90afd9b09faf3b0f8f3c423d4f1cfb4c2e33a1b86770e426fa8 | 206 | < 0.1% |
| 2790324c84cdb8ba471be2a199cfb5103bbe1ab10883a0312b6928b05d6ee6c4 | 176 | < 0.1% |
| a1959a16bf167858c93a66ec2a330644512b25fb10f97eee2058549885af4dbd | 173 | < 0.1% |
| 9d5787501bf1c77592156ba51eab13f4a2670c807686431a9e22a69090b02358 | 172 | < 0.1% |
| cc4ed85e30f4977dae47662ddc468cd2eec11472de6fac5ec985080fd92243c8 | 163 | < 0.1% |
| 3eb41c8511d4e04fc0f02452e6e15d206d0c0e9d0f25ff79aeeea7f62561d5a5 | 143 | < 0.1% |
| Other values (254531) | 1201826 |
Length
| Value | Count | Frequency (%) |
| 2c29ae653a9282cce4151bd87643c907644e09541abc28ae87dea0d1f6603b1c | 26486 | 2.2% |
| 5b7eb31eabebd3277de632b82267286d847fd5d44287ee150bb4206b48439145 | 229 | < 0.1% |
| 7c1fa3b0ec1d37ce2c3f34f63bd792f3b4494f324b6be5d1e4ba6a75456b96a7 | 220 | < 0.1% |
| a5ca21aefc3cf90afd9b09faf3b0f8f3c423d4f1cfb4c2e33a1b86770e426fa8 | 206 | < 0.1% |
| 1f5bd429acc88fbbf24de844a59e438704aa8761bc7b99fd977cad297c50b74c | 206 | < 0.1% |
| 2790324c84cdb8ba471be2a199cfb5103bbe1ab10883a0312b6928b05d6ee6c4 | 176 | < 0.1% |
| a1959a16bf167858c93a66ec2a330644512b25fb10f97eee2058549885af4dbd | 173 | < 0.1% |
| 9d5787501bf1c77592156ba51eab13f4a2670c807686431a9e22a69090b02358 | 172 | < 0.1% |
| cc4ed85e30f4977dae47662ddc468cd2eec11472de6fac5ec985080fd92243c8 | 163 | < 0.1% |
| 3eb41c8511d4e04fc0f02452e6e15d206d0c0e9d0f25ff79aeeea7f62561d5a5 | 143 | < 0.1% |
| Other values (254531) | 1201826 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 4965077 | 6.3% |
| a | 4958163 | 6.3% |
| e | 4953247 | 6.3% |
| 6 | 4948682 | 6.3% |
| 4 | 4946499 | 6.3% |
| 2 | 4941698 | 6.3% |
| 1 | 4934616 | 6.3% |
| 8 | 4926926 | 6.3% |
| 9 | 4916596 | 6.2% |
| 0 | 4913973 | 6.2% |
| Other values (6) | 29314523 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 49221446 | |
| Lowercase Letter | 29498554 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 4948682 | |
| 4 | 4946499 | |
| 2 | 4941698 | |
| 1 | 4934616 | |
| 8 | 4926926 | |
| 9 | 4916596 | |
| 0 | 4913973 | |
| 3 | 4902163 | |
| 7 | 4899189 | |
| 5 | 4891104 |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 4965077 | |
| a | 4958163 | |
| e | 4953247 | |
| b | 4897737 | |
| d | 4888303 | |
| f | 4836027 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 49221446 | |
| Latin | 29498554 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 4948682 | |
| 4 | 4946499 | |
| 2 | 4941698 | |
| 1 | 4934616 | |
| 8 | 4926926 | |
| 9 | 4916596 | |
| 0 | 4913973 | |
| 3 | 4902163 | |
| 7 | 4899189 | |
| 5 | 4891104 |
Latin
| Value | Count | Frequency (%) |
| c | 4965077 | |
| a | 4958163 | |
| e | 4953247 | |
| b | 4897737 | |
| d | 4888303 | |
| f | 4836027 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 78720000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 4965077 | 6.3% |
| a | 4958163 | 6.3% |
| e | 4953247 | 6.3% |
| 6 | 4948682 | 6.3% |
| 4 | 4946499 | 6.3% |
| 2 | 4941698 | 6.3% |
| 1 | 4934616 | 6.3% |
| 8 | 4926926 | 6.3% |
| 9 | 4916596 | 6.2% |
| 0 | 4913973 | 6.2% |
| Other values (6) | 29314523 |
Auto
The auto setting is an easily interpretable pairwise column metric of the following mapping: vartype-vartype : method, categorical-categorical : Cramer's V, numerical-categorical : Cramer's V (using a discretized numerical column), numerical-numerical : Spearman's ρ. This configuration uses the best suitable for each pair of columns.Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| customer_id | article_id | price | sales_channel_id | sale | product_code | prod_name | product_type_no | product_type_name | product_group_name | graphical_appearance_no | graphical_appearance_name | colour_group_code | colour_group_name | perceived_colour_value_id | perceived_colour_value_name | perceived_colour_master_id | perceived_colour_master_name | department_no | department_name | index_code | index_name | index_group_no | index_group_name | section_no | section_name | garment_group_no | garment_group_name | detail_desc | FN | Active | club_member_status | fashion_news_frequency | age | postal_code | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | f05a521a2649a53841d0c5c837efb1d48e2eff7a6f6e47f94f0e21665d7adaa3 | 529008044 | 0.024288 | 2 | yes | 529008 | Hazelnut Push Melbourne | 306 | Bra | Underwear | 1010016 | Solid | 31 | Light Orange | 1 | Dusty Light | 3 | Orange | 1338 | Expressive Lingerie | B | Lingeries/Tights | 1 | Ladieswear | 61 | Womens Lingerie | 1017 | Under-, Nightwear | Lace push-up bra with underwired, moulded, padded cups for a larger bust and fuller cleavage. Narrow, adjustable shoulder straps and a narrow fastening at the back with two pairs of hooks and eyes. | NaN | NaN | ACTIVE | NONE | 34.0 | 2fb862f50c007c58b21e045956ca10469feccd2dbe7ccc81e596dc5d926992f3 |
| 1 | 58afa373cb889cda30831ba3ca728bbb4147d5c1f3d19060f003bf5713d7f4f5 | 537688014 | 0.040661 | 2 | yes | 537688 | Rachel | 252 | Sweater | Garment Upper body | 1010010 | Melange | 8 | Dark Grey | 4 | Dark | 12 | Grey | 1626 | Knitwear | A | Ladieswear | 1 | Ladieswear | 15 | Womens Everyday Collection | 1003 | Knitwear | Long polo-neck jumper in a soft knit with long raglan sleeves and ribbing at the cuffs and hem. | 1.0 | 1.0 | ACTIVE | Regularly | 29.0 | f083eb09535f454fe68dfcae389b759cf9b71ff45271cb0a1a99996a9a6be1e6 |
| 2 | 317ea97640e31f706565f2b61f17652ac569f05c1abc47fdf9fb2c4b446ca343 | 872298001 | 0.006085 | 1 | yes | 872298 | Bonina loose tank | 253 | Vest top | Garment Upper body | 1010016 | Solid | 10 | White | 3 | Light | 9 | White | 1640 | Tops Fancy Jersey | D | Divided | 2 | Divided | 53 | Divided Collection | 1005 | Jersey Fancy | Cropped top in airy, fluted jersey with narrow, adjustable shoulder straps, buttons down the front and narrow, covered elastication and a tie detail at the hem. | 1.0 | 1.0 | ACTIVE | Regularly | 40.0 | 624cb91bb12c602f4ce8165bb2b16af165ee85b4ee06add41f0d6a5fa61038d0 |
| 3 | 6559a47c9760bc36d3f7a7497306daa1ea9ce4a3a340a0abfe07325b76f4cd1e | 562455002 | 0.025407 | 2 | yes | 562455 | Edit fancy dress | 265 | Dress | Garment Full body | 1010001 | All over pattern | 9 | Black | 4 | Dark | 5 | Black | 7930 | Young Girl Jersey Basic | I | Children Sizes 134-170 | 4 | Baby/Children | 79 | Girls Underwear & Basics | 1002 | Jersey Basic | Long-sleeved dress in cotton jersey with a seam at the waist and bell-shaped skirt. | NaN | NaN | ACTIVE | NONE | 27.0 | aa49a6081cd770489f90dfdf8677a1957110e505320d923bf4e913686569a8bb |
| 4 | 10292f992bbf7a999f8f2eee6c1b2de299ee1279e369223b73c8baf6d65fce21 | 504154034 | 0.015237 | 2 | yes | 504154 | Lady Di | 252 | Sweater | Garment Upper body | 1010016 | Solid | 73 | Dark Blue | 4 | Dark | 2 | Blue | 1626 | Knitwear | A | Ladieswear | 1 | Ladieswear | 15 | Womens Everyday Collection | 1003 | Knitwear | Wide, long-sleeved jumper in a soft, rib knit containing some wool. | 1.0 | NaN | ACTIVE | Regularly | 61.0 | c84181c2d1711d7b9e76fcea1f045409a608320eaad5338d096196a92a8c1785 |
| 5 | fec78952f447ad721a39d19428132705b7fc4dfbda0d66a39b7ad439cbaae4e2 | 704150008 | 0.025407 | 2 | yes | 704150 | FUN FANCY CREW | 252 | Sweater | Garment Upper body | 1010002 | Application/3D | 71 | Light Blue | 1 | Dusty Light | 2 | Blue | 7648 | Kids Boy Jersey Fancy | H | Children Sizes 92-140 | 4 | Baby/Children | 46 | Kids Boy | 1005 | Jersey Fancy | Long-sleeved top in sweatshirt fabric with a motif on the front and ribbing around the neckline, cuffs and hem. | NaN | NaN | PRE-CREATE | NONE | 53.0 | 8a8480eb9fd1930ada443e904d3c1bf4eedd35f1647dd29bea6fc523f3083353 |
| 6 | c392502a2b7758c9504a2cd5ec6ce432a38a3863594b96727519cecc603a28e9 | 677511001 | 0.008458 | 1 | yes | 677511 | Basic Kjell bracelet pk | 68 | Bracelet | Accessories | 1010016 | Solid | 5 | Gold | 5 | Bright | 15 | Metal | 4344 | Jewellery | C | Ladies Accessories | 1 | Ladieswear | 66 | Womens Small accessories | 1019 | Accessories | Slightly elasticated metal bracelets. | NaN | NaN | ACTIVE | NONE | 24.0 | 0fc3491c64d73d2507f376cac6cadd23a3d012c3b148ad9b406052e4b98373d0 |
| 7 | 9f842fd2d47f3330c54b25c8128285953dc0fa0dc9c9df196bb278f811524ef5 | 863620007 | 0.016932 | 2 | yes | 863620 | Archie | 254 | Top | Garment Upper body | 1010017 | Stripe | 73 | Dark Blue | 4 | Dark | 2 | Blue | 1676 | Jersey Basic | A | Ladieswear | 1 | Ladieswear | 16 | Womens Everyday Basics | 1002 | Jersey Basic | Straight-cut top in soft cotton jersey with a boat neck and long sleeves. | NaN | NaN | ACTIVE | NONE | 22.0 | fd6f02ef5d58f9a461a747461f01e65f7a13e2702e72d3abdc542ab333a5800e |
| 8 | fca73f5109fa6309a4db5330d1b1d2009309fe1d51ae3b95b754969aa34c68bc | 761221002 | 0.042356 | 1 | yes | 761221 | FRIDA PILE HOOD | 308 | Hoodie | Garment Upper body | 1010016 | Solid | 11 | Off White | 1 | Dusty Light | 9 | White | 1660 | Jersey | A | Ladieswear | 1 | Ladieswear | 6 | Womens Casual | 1005 | Jersey Fancy | Wide top in soft pile with a drawstring hood, kangaroo pocket, dropped shoulders and long sleeves. | 1.0 | 1.0 | ACTIVE | Regularly | 19.0 | 9873eb374893e0e8d4612ac48d077c17ee583c908c3743c4b893a59f689d4b8c |
| 9 | 82b3e74a6dedb8e12d55abfb0c773ff64f74aba6cb8dc35a05ae96f25e25f91e | 658298007 | 0.021356 | 2 | yes | 658298 | Skirt Mini | 275 | Skirt | Garment Lower body | 1010023 | Denim | 71 | Light Blue | 1 | Dusty Light | 2 | Blue | 1422 | Skirt | A | Ladieswear | 1 | Ladieswear | 15 | Womens Everyday Collection | 1012 | Skirts | 5-pocket skirt in washed denim with a high waist, button fly and frayed, raw-edge hem. | 1.0 | 1.0 | ACTIVE | Regularly | 29.0 | 3a47e8b067098201887b9d639fae33b39a5cbc376bcf238d6914162bb3d88678 |
Last rows
| customer_id | article_id | price | sales_channel_id | sale | product_code | prod_name | product_type_no | product_type_name | product_group_name | graphical_appearance_no | graphical_appearance_name | colour_group_code | colour_group_name | perceived_colour_value_id | perceived_colour_value_name | perceived_colour_master_id | perceived_colour_master_name | department_no | department_name | index_code | index_name | index_group_no | index_group_name | section_no | section_name | garment_group_no | garment_group_name | detail_desc | FN | Active | club_member_status | fashion_news_frequency | age | postal_code | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1229990 | 3233f9c5309a304ed4aadad23a9095cb7910b3cee84960e9cdfe8ad3c215b86b | 830365003 | 0.033881 | 2 | yes | 830365 | Anais. | 258 | Blouse | Garment Upper body | 1010001 | All over pattern | 9 | Black | 4 | Dark | 5 | Black | 1522 | Blouse | A | Ladieswear | 1 | Ladieswear | 15 | Womens Everyday Collection | 1010 | Blouses | Blouse in woven fabric with a sweetheart neckline and long puff sleeves that are elasticated at the top and have a covered button at the cuffs. Gathered elastication at the front and down the sides and a smocked section at the back. Unlined. | NaN | NaN | ACTIVE | NONE | 22.0 | 1896679852dbefe6480542b1a4e52541bdf6869286d93757c2b5c76cb35f787a |
| 1229991 | 10fd376f323e87a41973f10321d8286364bfe036c05fe0af403a47da11276be6 | 554477026 | 0.013542 | 1 | yes | 554477 | Victoria Pull- On TRS | 272 | Trousers | Garment Lower body | 1010001 | All over pattern | 9 | Black | 4 | Dark | 5 | Black | 1747 | Trousers | D | Divided | 2 | Divided | 53 | Divided Collection | 1009 | Trousers | Ankle-length trousers in an airy viscose weave with a regular, elasticated waist, side pockets and slightly wider, tapered legs. | NaN | NaN | ACTIVE | NONE | 29.0 | 6f337cea300ee5fbf8edcc0c68ba3dbba67a33be5c18c069576a0182c6c8ce67 |
| 1229992 | c18088b5b9a7d6f0c26b266fd2797ae9124a3050315411741d1ed5da59e58041 | 756320020 | 0.033881 | 1 | yes | 756320 | Lindsay Sl-set (W) | 297 | Pyjama set | Nightwear | 1010001 | All over pattern | 9 | Black | 4 | Dark | 5 | Black | 3709 | Nightwear | B | Lingeries/Tights | 1 | Ladieswear | 62 | Womens Nightwear, Socks & Tigh | 1017 | Under-, Nightwear | Pyjama top and shorts in soft satin. V-neck cami top with adjustable spaghetti shoulder straps and lace at the top. Short shorts with narrow elastication at the waist and lace-trimmed hems. | 1.0 | 1.0 | ACTIVE | Regularly | 43.0 | 2c29ae653a9282cce4151bd87643c907644e09541abc28ae87dea0d1f6603b1c |
| 1229993 | 3bce1f7f7b4050e16553ae6a24e3a685f8fe754725bcb73bb283673a6fcbb697 | 753061003 | 0.008458 | 2 | yes | 753061 | Drake | 265 | Dress | Garment Full body | 1010017 | Stripe | 11 | Off White | 3 | Light | 9 | White | 1676 | Jersey Basic | A | Ladieswear | 1 | Ladieswear | 16 | Womens Everyday Basics | 1002 | Jersey Basic | Short, sleeveless dress in soft cotton and modal jersey with a deep neckline and a narrow, elasticated seam at the waist. | 1.0 | 1.0 | ACTIVE | Regularly | 38.0 | 70f3c4ef6273aa575da3fdc5aed6e6441e14cf9717800d1a5d781dd88bffa6a0 |
| 1229994 | a78dfa6fe5f13c9e62dfe8b7c14b75fbabc9c41bbd39b59828a4bd80050a5dc8 | 866383003 | 0.020322 | 2 | yes | 866383 | Push it Push Bra. | 298 | Bikini top | Swimwear | 1010026 | Other structure | 50 | Other Pink | 5 | Bright | 4 | Pink | 4242 | Swimwear | B | Lingeries/Tights | 1 | Ladieswear | 60 | Womens Swimwear, beachwear | 1018 | Swimwear | Lined bikini top with padded cups for a larger bust and fuller cleavage. Wide shoulder straps and a metal fastener at the back. | NaN | NaN | ACTIVE | NONE | 24.0 | decd492b4b79d4648348cafcf8328556af70bdf8f1a0a0197bdd9d71fe561411 |
| 1229995 | b8f97d5de0b32a78f4c66349001852443047790b3d6b5a4c228d38f2b1f833a5 | 664405002 | 0.012576 | 1 | yes | 664405 | Virgo Hip belt | 67 | Belt | Accessories | 1010016 | Solid | 9 | Black | 4 | Dark | 5 | Black | 3509 | Belts | C | Ladies Accessories | 1 | Ladieswear | 65 | Womens Big accessories | 1019 | Accessories | Belt with a metal buckle. Width 2.5 cm. | NaN | NaN | ACTIVE | NONE | 54.0 | cba3b70e9265ee425109d7d8e26abe9438a7f9c74f94dcdddcf15904c7d6496f |
| 1229996 | ad0c35bb8dae968d35a52c3ef2eac21fc09cceef33a0f3872918d7c88abcd829 | 685816002 | 0.008458 | 2 | yes | 685816 | RONNY REG RN T-SHIRT | 255 | T-shirt | Garment Upper body | 1010016 | Solid | 9 | Black | 4 | Dark | 5 | Black | 5832 | Light Basic Jersey | F | Menswear | 3 | Menswear | 26 | Men Underwear | 1002 | Jersey Basic | Round-necked T-shirt in soft cotton jersey. | NaN | NaN | ACTIVE | NONE | 22.0 | 14b2909c3d1349c6cb31518cbe321863385799052b4b51b28ab0171ef410aed0 |
| 1229997 | 11ede6d0f206c2b22c6e0aaedcd7018843152cbbcb76996f98db97c5a73ecad9 | 717874003 | 0.042356 | 2 | yes | 717874 | Swift Padded Swimsuit | 57 | Swimsuit | Swimwear | 1010017 | Stripe | 10 | White | 3 | Light | 9 | White | 4242 | Swimwear | B | Lingeries/Tights | 1 | Ladieswear | 60 | Womens Swimwear, beachwear | 1018 | Swimwear | Fully lined swimsuit with a V-neck, narrow adjustable shoulder straps and cups with removable inserts that shape the bust and provide good support. | 1.0 | 1.0 | ACTIVE | Regularly | 21.0 | cc35d8ab7dea6c9d48f1d61c7129f1e626468761dfbdf6904c6afe113e4a7093 |
| 1229998 | 1fae3e0134069f937b6edf6a2fd974fd2f70bcda4921a622f06cd4ce605702d5 | 351484027 | 0.017610 | 2 | yes | 351484 | Lazer Razer Brief | 59 | Swimwear bottom | Swimwear | 1010016 | Solid | 42 | Red | 5 | Bright | 18 | Red | 4242 | Swimwear | B | Lingeries/Tights | 1 | Ladieswear | 60 | Womens Swimwear, beachwear | 1018 | Swimwear | Fully lined bikini bottoms with a mid waist, medium coverage at the back and laser-cut, scalloped edges. | NaN | NaN | ACTIVE | NONE | 22.0 | c16e64df118ea27c2d8d57d218832b849c532de79383e9453f032613b25abebe |
| 1229999 | 45f48c39c7f5bbd3f9ee57cc6b424658861e730975cbcb736930c5b4ae8063cf | 674010009 | 0.042356 | 2 | yes | 674010 | SPEED Veronica dress w | 265 | Dress | Garment Full body | 1010016 | Solid | 42 | Red | 5 | Bright | 18 | Red | 1344 | Dresses | D | Divided | 2 | Divided | 53 | Divided Collection | 1013 | Dresses Ladies | Short, V-neck dress with a wrapover front, short flounced sleeves, a concealed fastening at the top and seam with a tie belt at the waist. Unlined. | NaN | NaN | ACTIVE | NONE | 28.0 | 26d9be9873c93e80209643b1eb382f5d10c378363b0a601d5cf755871daaf601 |
Most frequently occurring
| customer_id | article_id | price | sales_channel_id | sale | product_code | prod_name | product_type_no | product_type_name | product_group_name | graphical_appearance_no | graphical_appearance_name | colour_group_code | colour_group_name | perceived_colour_value_id | perceived_colour_value_name | perceived_colour_master_id | perceived_colour_master_name | department_no | department_name | index_code | index_name | index_group_no | index_group_name | section_no | section_name | garment_group_no | garment_group_name | detail_desc | FN | Active | club_member_status | fashion_news_frequency | age | postal_code | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5573 | d00063b94dcb1342869d4994844a2742b5d62927f36843164fb3f818f630bca9 | 678342001 | 0.006763 | 1 | yes | 678342 | Lima SS. | 255 | T-shirt | Garment Upper body | 1010016 | Solid | 9 | Black | 4 | Dark | 5 | Black | 1643 | Basic 1 | D | Divided | 2 | Divided | 51 | Divided Basics | 1002 | Jersey Basic | Fitted T-shirt in soft cotton jersey with a slightly wider neckline with a narrow ribbed trim. | NaN | NaN | ACTIVE | NONE | 27.0 | ecfb1e6aed8dde7c46c955c26185c51a1c21ca5ad6f819febb877c2506138204 | 26 |
| 3996 | 94665b46e194622ccdbcadc0170f13a2f8ede1ff6d057d43a19b8938c808b662 | 629420001 | 0.008458 | 2 | yes | 629420 | claudine | 255 | T-shirt | Garment Upper body | 1010016 | Solid | 10 | White | 3 | Light | 9 | White | 1676 | Jersey Basic | A | Ladieswear | 1 | Ladieswear | 16 | Womens Everyday Basics | 1002 | Jersey Basic | T-shirt in soft cotton jersey. | NaN | NaN | ACTIVE | NONE | 23.0 | 45a5d77c5dc765f23b4ce8b38f30da63359061f8fb0b09e1cd5ac3c0398ade40 | 9 |
| 3846 | 8f5f1e993eff204ca7206cabe0fc6dfb75759994cacbf4c32c84ec5699a51c5d | 189634001 | 0.013542 | 2 | yes | 189634 | Long Leg Leggings | 273 | Leggings/Tights | Garment Lower body | 1010016 | Solid | 9 | Black | 4 | Dark | 5 | Black | 1643 | Basic 1 | D | Divided | 2 | Divided | 51 | Divided Basics | 1002 | Jersey Basic | Leggings in stretch jersey with an elasticated waist. | NaN | NaN | PRE-CREATE | NONE | NaN | 8a9ef31b5300ef3aaee31d794ac23c8294ac33042463da92e91b26274257c7f8 | 7 |
| 6358 | ef38ec0f0cb29ee8bbb87efc82fd16f4b99127e3eeefe69c9b5fce627e93e270 | 570002001 | 0.012186 | 1 | yes | 570002 | ROY SLIM RN T-SHIRT | 255 | T-shirt | Garment Upper body | 1010016 | Solid | 9 | Black | 4 | Dark | 5 | Black | 5832 | Light Basic Jersey | F | Menswear | 3 | Menswear | 26 | Men Underwear | 1002 | Jersey Basic | Round-necked T-shirt in soft jersey. | NaN | NaN | ACTIVE | NONE | 24.0 | 2c29ae653a9282cce4151bd87643c907644e09541abc28ae87dea0d1f6603b1c | 7 |
| 1360 | 31db71ea558704fd429f0c9bb7f76475bd73577c9bf668d39260f31b9bfbce12 | 728162001 | 0.008458 | 2 | yes | 728162 | Talia (1) | 255 | T-shirt | Garment Upper body | 1010016 | Solid | 9 | Black | 4 | Dark | 5 | Black | 1676 | Jersey Basic | A | Ladieswear | 1 | Ladieswear | 16 | Womens Everyday Basics | 1002 | Jersey Basic | V-neck T-shirt in cotton jersey with a rounded hem. Slightly longer at the back. | NaN | NaN | ACTIVE | NONE | 55.0 | 0f1582b0c7c263c53a0ab88d134a80c9db6147455e1f94475c09ccc0bced2d4c | 6 |
| 3802 | 8de98d98789e2d90eb7d8b3b631e0a7d895aba860124a20e26c388998437b757 | 828047002 | 0.013542 | 2 | yes | 828047 | Blossom tee | 255 | T-shirt | Garment Upper body | 1010016 | Solid | 10 | White | 3 | Light | 9 | White | 1676 | Jersey Basic | A | Ladieswear | 1 | Ladieswear | 16 | Womens Everyday Basics | 1002 | Jersey Basic | Fitted, round-necked T-shirt in ribbed organic cotton jersey. | NaN | NaN | ACTIVE | NONE | 37.0 | 3f38900bac4f9881cb27273e8085e7b9ba1943d7a1d4c6e52965e40c23d6c9ad | 6 |
| 5527 | ce79a54991bb7c2c2d9427ae1e7f1d8c8b037f8d74b2fe659e87ad70e73ca6e7 | 570004009 | 0.016932 | 2 | yes | 570004 | PETER POLO | 257 | Polo shirt | Garment Upper body | 1010016 | Solid | 9 | Black | 4 | Dark | 5 | Black | 5832 | Light Basic Jersey | F | Menswear | 3 | Menswear | 26 | Men Underwear | 1002 | Jersey Basic | Short-sleeved polo shirt in soft jersey with a collar and button placket. | 1.0 | 1.0 | ACTIVE | Regularly | 32.0 | 40d74bff9dfc6f518a5e5ae8154c901606a2eb80a24b47c8121dd537cea03a70 | 6 |
| 123 | 04dab48e5805e9c05272604ac78eb5eb941850ce307a7dd4bb5fe4652c0e4915 | 695544001 | 0.033881 | 2 | yes | 695544 | Pluto slacks RW | 272 | Trousers | Garment Lower body | 1010016 | Solid | 9 | Black | 4 | Dark | 5 | Black | 1722 | Trouser | A | Ladieswear | 1 | Ladieswear | 15 | Womens Everyday Collection | 1009 | Trousers | Ankle-length cigarette trousers in stretch satin made from a cotton blend with a zip fly, concealed hook-and-eye fasteners and a regular waist with concealed elastication. Side pockets, fake welt back pockets and tapered legs with creases. | NaN | NaN | ACTIVE | NONE | 56.0 | 2514309f2126697aa9611fa8ad638c89b6b6cbae3ce55ebcbf886bc70e00e224 | 5 |
| 538 | 1472c551f2c04873edddc853e214a033692b2b1d6ae2bb14f369d633202f1980 | 852521001 | 0.030492 | 2 | yes | 852521 | MALOU CREW | 252 | Sweater | Garment Upper body | 1010008 | Front print | 11 | Off White | 1 | Dusty Light | 9 | White | 1660 | Jersey | A | Ladieswear | 1 | Ladieswear | 6 | Womens Casual | 1005 | Jersey Fancy | Boxy top in sweatshirt fabric with a motif on the front, low dropped shoulders and long sleeves with decorative seams. Ribbing around the neckline, cuffs and hem. Soft brushed inside. | NaN | NaN | ACTIVE | NONE | 48.0 | 45da989c2203268d20ff768b1c723c1f97fcbacd2daf67648ad3d89ae25bcadd | 5 |
| 2655 | 61da44a2758206d5701771f4315637b40c8321b511191654fb1430a6408e4dfa | 507909001 | 0.021593 | 1 | yes | 507909 | Rebecca or Delphine shirt | 259 | Shirt | Garment Upper body | 1010016 | Solid | 10 | White | 3 | Light | 9 | White | 1515 | Blouse | A | Ladieswear | 1 | Ladieswear | 11 | Womens Tailoring | 1010 | Blouses | Gently tailored shirt in a stretch cotton blend with a turn-down collar, V-neck, buttons down the front and buttoned cuffs. | 1.0 | 1.0 | ACTIVE | Regularly | 29.0 | fce375cd69ffecf89cbc59ce8ee3b69436c86c469adc269b6305f7607e6006e6 | 5 |